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NEWS 


612 Roundup of the week’s news 


614 TELESCOPE CLASH DEEPLY ROOTED 
IN HAWAII'S PAST 

Opponents say Thirty Meter Telescope 
violates sovereignty and sacred ground 
By I. Loomis and A. Cho 


615 DEEP-OCEAN MICROBE IS CLOSEST 
LIVING RELATIVE OF COMPLEX CELLS 
Genomic study of “Loki” supports 

a revisionist view of the origin of 
eukaryotes By M. Leslie 


616 JAPAN’S ‘NIH’ STARTS WITH MODEST 
FUNDING BUT HIGH AMBITIONS 

Key priority is filling nation’s sparse 
drug pipeline By D. Normile 
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617 WHEN MODERN BIRDS TOOK FLIGHT 
Feathered fossils from China show that 
modern birds could fly and wade earlier 
than expected By M. Balter 


618 NEW DATABASE LINKS REGULATORY 
DNATO ITS TARGET GENES 

Recent big genomics projects aid 
disease studies By E. Pennisi 

> PERSPECTIVE P. 640; RESEARCH ARTICLE 
P.648; REPORTS PP. 660 & 666 


620 ITALY’S OLIVES UNDER SIEGE 
Blight alarms officials across Europe 
By E. Stokstad 


622 KING OF THE PILLS 

Raymond Schinazi’s handful of 
lifesaving drugs has earned him riches, 
esteem, and a dose of enmity By J. Cohen 


626 SAVING CALIFORNIA'S CALVES 

After a half-century of research, scientists 
find a promising solution to mysterious 
abortions in beef cattle By I. Chen 
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Hyperactive astrocytes 
impair brain development 


INSIGHTS 


628 BRAVE NEW WORLD 


632 UNDERSTANDING FRICTION IN 
LAYERED MATERIALS 

A scanning probe technique provides 
a clearer picture of friction at the 
nanometer scale By K. M. Liechti 

> REPORT P. 679 


633 MORE THAN JUST BUGS IN SPIT 
Advances in imaging help to explain 
tuberculosis treatment failures and 
develop better drugs By C. Barry 


635 ACLOSER MIMIC OF THE OXYGEN 
EVOLUTION COMPLEX OF PHOTOSYSTEM II 
An inorganic cluster replicates many of 
the structural aspects of the complex 
that photosplits water and powers 
photosynthesis By L. Sun 

> REPORT P. 690 


636 “RASopathic” ASTROCYTES 
CONSTRAIN NEURAL PLASTICITY 

The cellular pathology of a complex 
neurodevelopmental disorder is teased 
apart By L. Xing et al. 

> SCIENCE TRANSLATIONAL MEDICINE 
RESEARCH ARTICLE BY R. KRENCIK ET AL. 


638 COMMITTING TO ECOLOGICAL 
RESTORATION 

Efforts around the globe need legal and 
policy clarification By K. Suding et al. 


640 GTEx DETECTS GENETIC EFFECTS 
The genetic basis for variation among 
individuals in transcript abundance 
across tissues is analyzed By G. Gibson 
> NEWS STORY P. 618; RESEARCH ARTICLE 

P. 648; REPORTS PP. 660 & 666 
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LETTERS 


642 PROMISES AND PERILS 
FOR THE PANDA 
By J. Liu 


642 NEXTGEN VOICES: LAST CALL 


642 THE POTENTIAL 
OF SECONDARY FORESTS 


By F- Bongers et al. 


643 NAMING DISEASES: 
FIRST DO NO HARM 
By K. Fukuda et al. 


RESEARCH 


IN BRIEF 


644 From Science and other journals 


REVIEW 
647 SOIL SCIENCE 


Soil and human security in the 21st 
century R. Amundson et al. 

REVIEW SUMMARY; FOR FULL TEXT: 
dx.doi.org/10.1126/science.1261071 


RESEARCH ARTICLE 


648 HUMAN GENOMICS 

The Genotype-Tissue Expression (GTEx) 
pilot analysis: Multitissue gene regulation 
in humans The GTEx Consortium 

> NEWS STORY P. 618; PERSPECTIVE P. 640; 
REPORTS PP. 660 & 666 


REPORTS 
HUMAN GENOMICS 


660 The human transcriptome across 
tissues and individuals M. Melé et al. 


666 Effect of predicted protein- 
truncating genetic variants on 
the human transcriptome 
M.A. Rivas et al. 

> NEWS STORY P. 618; PERSPECTIVE P. 640; 

RESEARCH ARTICLE P. 648 


Getting a tighter grasp 
on friction 


670 SUPERNOVAE 

44Ti gamma-ray emission lines from 
SN1987A reveal an asymmetric 
explosion S. E. Boggs et al. 


672 PHYSICS 

Creating and probing electron 
whispering-gallery modes in graphene 
Y. Zhao et al. 


676 GEOPHYSICS 

Migrating tremor off southern Kyushu 
as evidence for slow slip of a shallow 
subduction interface Y. Yamashita et al. 


679 SURFACE SCIENCE 

Adhesion and friction in mesoscopic 
graphite contacts E. Koren et al. 

> PERSPECTIVE P. 632 


683 SOLAR CELLS 

Impact of microstructure on local 
carrier lifetime in perovskite solar cells 
D. W. deQuilettes et al. 


686 CATALYSIS 

Identification of molybdenum oxide 
nanostructures on zeolites for natural 
gas conversion J. Gao et al. 


690 INORGANIC CHEMISTRY 

A synthetic Mn,Ca-cluster mimicking 
the oxygen-evolving center of 
photosynthesis C. Zhang et al. 

> PERSPECTIVE P. 635 


694 VACCINES 

Long-term measles-induced 
immunomodulation increases overall 
childhood infectious disease mortality 
M. J. Mina et al. 

> PODCAST 


699 CHROMOSOMES 

CENP-C reshapes and stabilizes CENP-A 
nucleosomes at the centromere 

S. J. Falk et al. 


704 STRUCTURAL VIROLOGY 
Near-atomic cryo-EM structure of 

the helical measles virus nucleocapsid 
I. Gutsche et al. 


Ahigh-resolution view of 
measles virus capsid protein 


707 OPTOGENETICS 
Engineering of a light-gated 
potassium channel C. Cosentino et al. 


711 MALARIA 

A forward genetic screen identifies 
erythrocyte CD55 as essential for 
Plasmodium falciparum invasion 
E. S. Egan et al. 


DEPARTMENTS 
611 EDITORIAL 


Give women an even chance 
By Marcia McNutt 


726 WORKING LIFE 
The making of a science evangelist 
By Ainissa Ramirez 


ON THE COVER 


A representation 

of how variation in 

the human genome 
affects gene expression 
among individuals and 
tissues. Colors and 
shapes show variations 
between people and 
within individuals. 
The Genotype-Tissue Expression (GTEx) 
Consortium examined postmortem tissue 
to document how genetic variants confer 
differences in gene expression across the 
human body. See pages 618, 640, 648, 660, 
and 666. Illustration: Thomas Danthony 
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EDITORIAL 


Give women an even chance 


n important part of the mentoring role of aca- 
demic advisors is writing letters of recommen- 
dation for worthy students who are applying 
for research grants, fellowships, and entrance 
to elite programs. Such letters often have more 
impact than grades or test scores at the gradu- 
ate level, where factors such as initiative, hard 
work, creativity, independence, problem-solving ability, 
and teamwork are prized 
skills. Well-crafted, honest 
letters help in determin- 
ing whether a candidate is 
a good fit. For that reason, 
I was surprised recently 
when asked to review 60 
proposals from graduate 
students for small research 
grants. More than 10% of 
the applicants had a least 
one supporting letter con- 
taining inappropriate mate- 
rial for the decision at hand. 
All of the students so af- 
fected, unfortunately, were 
women, and those writ- 
ing the problematic letters 
were nearly equally men 
and women. By describing 
how these instances appear 
from the standpoint of a se- 
lection committee member, 
my hope is that the profes- 
sionalism in recommenda- 
tion letters will improve. 

As examples of the sort of problems, one letter 
described how the candidate was so good to her elderly 
mother, yet still enjoyed life, spending time in nature 
with her husband and her animal friends. Another 
letter reflected amazement that the candidate managed 
to balance so efficiently being a student, a scientist, 
and a mother. Such comments do indeed make impres- 
sions on a committee that must select only 1 in 10 for 
an award from a highly competitive pool. They cannot 
help but put a candidate at a disadvantage when com- 
pared to others who are praised for their self-initiated 
research projects, interesting uses of coursework to 
address new scientific problems, or careful background 
preparation for the research project proposed. 

Beyond comments that were simply out of place for 


“.. Nice’ never got me a research 
grant or professional position.” 


the decision at hand, I noted an overall bias in the lan- 
guage used to describe the male candidates versus some 
of the female candidates. In some letters, women were 
described as “friendly,” “kind,” “pleasant,” “humble,” and 
frequently, “nice.” While I am sure they were all of that, 
those were not necessarily the most relevant character- 
istics I was looking for in the next generation of scien- 
tists to advance the frontiers of discovery. Very different 
words were used to describe 
the male candidates (and 
many of the females as 
well): “brilliant,” “creative,” 
“hard-working,” “insightful,” 
and “showing leadership.” 

Studies have documented 
bias in referees’ unblinded 
reviews of manuscripts for 
publication, but this was 
the first time that I noted 
such a bias in the language 
used to assess a student’s 
suitability for a research 
grant—a bias that could 
carry over to future fund- 
ing decisions. I suspect that 
such bias creeps in uninten- 
tionally in most cases, but I 
also understand that gender 
stereotypes persist and that 
they have a basis in reality. 
Women still bear the major- 
ity of caregiver burdens for 
children and elderly par- 
ents. But that information 
does not belong in letters of recommendation. I like to 
think that I am a nice person. But “nice” never got me a 
research grant or professional position. 

Now is the time to promote women as well as 
other underrepresented groups in science. The future 
strength of the global scientific enterprise depends on 
boosting the influx of diverse and talented research- 
ers into science and engineering fields. Subtle bias, 
even unintended, in advancing this cohort can be more 
damaging than outright bias, because it is more dif- 
ficult to detect and correct. I urge all who write these 
important letters of recommendation to take a last 
look before hitting “send” to be sure that what you 
have written is free of bias. 

- Marcia MeNutt 


Marcia McNutt 
Editor-in-Chief 
Science Journals 
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Whale mouths sport bungee-cord nerves 


Feeding humpback whales can open wide thanks to expandable nerves (right). 


hen a humpback whale gets hungry, it pumps its tail and 
fluke to lunge through the briny, prey-laden water with its 
jaw dangling nearly perpendicular to its body. To accom- 
modate that mouthful, the family of baleen whales called 
rorquals (Balaenopteridae)—which includes blue, fin, and 
humpback whales—has evolved unique features such as 
highly flexible jaw joints, a deformable tongue that inverts into a 
sac to hold the seawater, and grooved, expandable blubber on their 
undersides. Now, scientists have identified another necessary adapta- 
tion: Nerves in the whales’ tongue and mouth can extend to more 
than double their original length. Stretching vertebrate nerves nor- 
mally leads to pain, paralysis, or even the detachment of nerve roots 
from the spinal cord. But researchers report this week in Current 
Biology that rorquals’ nerves consist of a core of folded bundles 
of nerve fibers surrounded by a thick wall of folded collagen and 
elastin—the same protein that keeps skin elastic. The nerves unfold 
until the collagen stiffens, preventing overelongation, and the elastin 
snaps the nerves to their previous shape. 
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44[T]he application of reason, more than any other means, 
has proven to offer hope for human survival on Earth. 99 


U.S. Representative Mike Honda (D-CA), introducing a resolution to declare a National 
Day of Reason as a secular alternative to the National Day of Prayer. 


AROUND THE WORLD 


House bill recommends NIH boost 


WASHINGTON, D.c. | The U.S. House 

of Representatives last week released a 
bipartisan draft bill that aims to jump-start 
biomedical innovation with new policies 

at the National Institutes of Health (NIH) 
and the Food and Drug Administration. 
The 21st Century Cures Act, spearheaded 
by Energy and Commerce Committee 
Chair Fred Upton (R-MI) and a senior 
democrat, Diana DeGette (D-CO), 
delighted research advocates with a sug- 
gested NIH budget increase of $10 billion 
over 5 years. An earlier draft released by 
Upton in January failed to win DeGette’s 
support and disappointed NIH advocates 
(Science, 6 February, p. 594). The new ver- 
sion cuts incentives for drug developers and 
creates an NIH innovation fund to support 
“young emerging scientists,’ precision medi- 
cine, and a third unspecified category. 
http://scim.ag/Cures2 


Giant telescope picks U.K. HQ 


JODRELL BANK, U.K. | The countries 
building the Square Kilometre Array 
(SKA), a vast radio telescope to be con- 
structed in South Africa and Australia, 
have turned down the chance of headquar- 
tering it in the historic Castello Carrarese 
in Padua, Italy, in favor of a new purpose- 
built HQ at SKA’s current interim home 
near Manchester, U.K. Castello Carrarese, 
until recently used as a prison, made early 
running in the race to host the HQ. “Both 
[nations] offered substantial financial 
support,” says SKA Director General Philip 
Diamond. But the 11 member countries 
opted for the United Kingdom, which 

will chip in £200 million to the project, 
including the HQ bid and its contribution 
to phase I of construction from 2018 to 
2023. http://scim.ag/SKAhome 


Americas declared rubella-free 


WASHINGTON, D.c. | North and South 
America have become the first region in 
the world to eliminate rubella, the Pan 
American Health Organization (PAHO) 
announced on 29 April. Rubella symptoms 
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A protester in 
Baltimore throws a 
gas canister back at 
police during riots 
on 28 April. 


Three Q’s 


Riots convulsed Baltimore, Maryland's streets last week following the funeral of a 25-year-old African-American man who died 
in police custody. Dan Braha, a social scientist at the New England Complex Systems Institute in Cambridge, Massachusetts, 
has been collecting Twitter data related to the riot as part of a larger study of social media and civil unrest. 


Q: How does Baltimore’s 
rioting compare to that 

in Ferguson, London, 

and elsewhere? 

A: In terms of the communica- 
tions and patterns of spread, 
they are remarkably similar. 
Even if the causes are different, 
there seems to be a universal 
pattern to civil unrest. 


are usually mild, but if contracted in 

early pregnancy, the virus can lead to 
miscarriage or birth defects known as con- 
genital rubella syndrome (CRS). The last 
endemic cases of rubella in the Americas 
were reported in Argentina in 2009, and 
the last case of CRS was reported in Brazil 
the same year. The 15-year-long elimina- 
tion campaign vaccinated an estimated 250 
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Q: What can you learn 
about the Baltimore riots 
from social media? 

A: It's interesting to see the 
pattern of spread, much 

like forest fires, spreading 

in clusters and locally. The 
riots, in my view, could easily 
spread across other cities 

in the United States where 


tipping point? 


million adolescents and adults in 
32 countries and territories in the Americas, 
said PAHO Director Carissa Etienne. 


Researcher drops primate work 


TUBINGEN, GERMANY | A researcher 
targeted by animal rights activists is giving 
up his work on primates. Nikos Logothetis, 
a director at the Max Planck Institute for 
Biological Cybernetics, says that he will 
conclude his experiments on macaques “as 
quickly as possible” and focus future work 
on rodents. In a 28 April letter, he cited the 
failure of the Max Planck Society (MPS) 
and other scientific organizations to press 
criminal charges against aggressive activ- 
ists as a factor in his decision. Logothetis’s 
research was the subject of a German 
television program in September that 
included footage filmed by an undercover 
activist. MPS says it was advised not to 
bring charges against the activist. 
http://scim.ag/Logothetis 
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racial tensions are high and 
are close to a tipping point. 


Q: How do you spot this 


A: There are three basic 

data from tweets: location, 
time, and intensity. First, you 
create a network of commu- 
nication, where the nodes are 


people or groups. Then, you 
correlate the communication 
patterns with characteristics 
such as gender [and] political 
attitudes. ... It is the intensity 
of the communication— 

how many messages over 
time—that predicts what is 
to come. Full interview at 
http://scim.ag/_riots. 


California climate goals 


SACRAMENTO | California has raised 

the bar for global climate change action. 
The state, halfway to its goal of cutting 
greenhouse gas emissions to 1990 levels by 
2020, now aims to drive carbon pollution 
40% below that mark by 2030. Governor 
Jerry Brown signed an executive order on 
29 April matching the European Union’s 
ambitious pledge for international climate 
change negotiations, to be finalized in 
Paris this December. California, unlike 
Europe, has created jobs while slashing 
carbon. The move by the world’s eighth 
largest economy will inspire nations to 
aim high on climate action, Brown’s sup- 
porters say. Critics maintain California 
business will suffer if the state is too 

far ahead of the pack. Indeed, because 
California generates less than 2% of global 
greenhouse gas emissions, Brown’s move 
helps the planet only if others follow. 
http://scim.ag/climateCA 
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Protesters, hereflying y 
Hawaiian flags, have 
halted construction of 
one of the world’s large 
optical telescopes. 


Telescope clash deeply rooted in Hawait’s past 


Opponents say Thirty Meter Telescope violates sovereignty and sacred ground 


By lima Loomis and Adrian Cho 


month after protests halted work on 

the Thirty Meter Telescope (TMT) on 

Hawaii’s Big Island, the impasse over 

what would be one of the world’s larg- 

est optical telescopes has deepened. 

Last week, as protesters continue to 
occupy the construction site on Mauna Kea, 
a 4205-meter dormant volcano, a state panel 
representing Native Hawaiians pulled its 
support for the project. 

The 30 April decision by the quasi- 
governmental Office of Hawaiian Affairs 
(OHA) underscores a deeper challenge for 
the TMT. Besides offending cultural and 
religious sensitivities by occupying a site 
on a sacred mountain, the project—led by 
an international consortium based in Pasa- 
dena, California—has gotten snagged on 
the thorny issue of Hawaiian sovereignty. 
“It comes down to the fact that there is an 
occupation of Hawaii by the United States,” 
says Anne Keala Kelly, an independent film- 
maker in Honolulu, who is making a docu- 
mentary about the protests. 

Managers of the $1.4 billion project still 
have the permits, issued 6 March by the 
state’s Department of Land and Natural Re- 
sources, to start construction. But they have 
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held off since 31 protesters were arrested on 
2 April. The TMT team worked for years to 
address Native Hawaiians’ concerns, says 
Sandra Dawson, TMT’s manager for Hawaii 
community affairs, so researchers were taken 
aback by the flare-up, especially on social me- 
dia. “Some of the people we're seeing now are 
people we've never seen before,” she says. 

TMT collaborators say they have strived 
to be respectful of Mauna Kea, which 
many Native Hawaiians consider the 
“piko,” or navel, that connects the island 
to Wakea, the Sky Father. Planners chose 
a site 193 meters below the sacred summit 
that has no known archaeological remains, 
researchers say. Opponents of the TMT say 
that is not much help, given that 13 tele- 
scopes already dot Mauna Kea. “It’s like an 
industrial park up there,” Kelly says. “This 
is cultural strip mining.” 

More important, opponents say, the TMT 
is emblematic of the greater dispossession of 
Native Hawaiians, which dates back to 1893, 
when the United States claimed the islands. 
“You can say, ‘Well, it’s a cultural issue; but 
it’s more than that,” Kelly says. 

Such sentiments struck a chord with the 
trustees of OHA, whose mission is to pro- 
tect and perpetuate Hawaiian culture. “Self- 
determination is right at our fingertips,” said 
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OHA Trustee Dan Ahuna before last week’s 
vote. “We have the opportunity to send a 
strong message that it is no longer business 
as usual for Hawaiians.” 

OHA’s vote withdraws its 2009 endorse- 
ment of the TMT. In a meeting in Honolulu, 
the board voted 6 to 0 to rescind support, 
with one trustee abstaining and two ab- 
sent. Still, many Hawaiian groups felt the 
vote didn’t go far enough and that the board 
should have expressed clear opposition to the 
project. A vote rejecting that stronger stance 
was met with shouts of “aole!” or “no!” from 
a standing-room-only crowd. 

“We're disappointed that OHA cannot 
come out in full support of their people, 
their constituents, their lahui [nation],’ says 
Andre Perez, an organizer with Movement 
for Aloha No ka WAina, one of the groups 
that sent protesters to Mauna Kea. How- 
ever, OHA Trustee Peter Apo said during the 
meeting that maintaining a more neutral po- 
sition would allow OHA to retain a voice in 
the project’s future. 

The TMT dispute has divided Native Ha- 
waiians, who constitute about 10% of the 
state’s population. The TMT has pledged to 
invest $3 million per year in the local com- 
munity, says Paul Coleman, an astronomer 
at the University of Hawaii, Manoa. It will 
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pay $1 million to the quasigovernmental Of- 
fice of Mauna Kea Management and OHA 
to lease the site (existing telescopes pay $1) 
and supply another $1 million each to sup- 
port science, technology, engineering, and 
mathematics (STEM) education and to train 
Hawaii residents to work on the TMT and 
the other telescopes on Mauna Kea. “This 
is a fantastic opportunity to give our kids a 
chance for the jobs that come along with the 
telescopes,” says Coleman, a Native Hawai- 
ian. The STEM money has already begun to 
flow, he notes. 

At the OHA meeting, Mailani Neal, an 
18-year-old high school student from the 
Big Island, testified that it was her dream 
to study physics and become an astronomer. 
“The stars brought our Hawaiian ancestors 
to Hawaii, and now we have the opportunity 
to bring the people of Hawaii to the stars,” 
she said, fighting back tears. “The Thirty 
Meter Telescope has provided a beacon for 
me and motivated me in all that I do.” 

But $3 million per year is meager com- 
pensation for the expropriation Hawaiians 
have endured, says Williamson Chang, a law 
professor at the University of Hawaii, Manoa, 
who opposes the TMT. The real issues are 
sovereignty and social justice, Chang says, 
noting that Native Hawaiians often end up 
on the bottom of the economic heap in their 
own land. “Even if you go to the university, 
you end up working in a restaurant,” he says. 
“You work two or three jobs and you can’t 
afford to raise a family.” 

Hawaii’s governor, David Ige, has called for 
dialogue among the various parties. In the 
meantime, TMT officials have set no date for 
starting construction and have no plans to 
push for the removal of the protesters, says 
Michael Bolte, an astronomer at the Uni- 
versity of California, Santa Cruz, and associ- 
ate director of the project. For now, he says, 
theyre encouraging Native Hawaiians who 
support the project to speak up. “They're our 
strongest allies.” 

As for the protesters, Kahookahi Kanuha, 
a leading organizer, says they are expanding 
their focus beyond the TMT to the state’s 
management of the whole mountain, push- 
ing to ensure compliance with environ- 
mental requirements and completion of a 
required cultural impact assessment. Protest 
groups will also oppose a planned 65-year 
extension on the University of Hawaii’s lease 
of the summit area, which is set to expire 
in 2033—just 11 years into TMT’s planned 
decades-long lifetime. Stopping the exten- 
sion “would put the TMT in a very tough 
position,’ Kanuha says, “and they would be 
forced to look for a new location.” & 


Ilima Loomis is a freelance journalist in 
Maui, Hawaii. 
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Deep-ocean microbe is closest 
living relative of complex cells 


Genomic study of “Loki” supports a revisionist view 


of the origin of eukaryotes 


By Mitch Leslie 


t’s one of the most significant, and most 
vexing, splits in life’s history. About 
2 billion years ago, the prokaryotes, rel- 
atively simple single-celled organisms 
that include bacteria and archaea, gave 
rise to the more elaborate eukaryotes, 
the lineage that ultimately spawned multi- 
cellular life forms such as fungi, plants, 
and animals like us. Now, researchers 
combing through muck from the bottom of 
the North Atlantic Ocean have identified 
an archaeon that is the closest living rela- 
tive of eukaryotes so far discovered. 
The microbe, informally dubbed Loki 
and described this week in Nature, has set 


Mitochondria and chloroplasts, research- 
ers agree, are descended from formerly free- 
living prokaryotes that took up residence in 
other ancient cells. But the identity of the 
organism that captured and tamed those 
microbes remains unclear. Molecular evi- 
dence suggests that archaea are the closest 
relatives of eukaryotes. Researchers have 
disagreed, however, about whether eukary- 
otes branched off from a simpler prokaryote 
before archaea emerged—the traditional 
three-domain view of life—or evolved later, 
directly from archaea (see diagram). 

Loki, known only from its gene frag- 
ments, supports the second hypothesis. 
As part of a project to uncover novel types 
of archaea, evolutionary biologist Thijs 


Allin the family 


Anewly discovered deep-sea microbe named Loki suggests eukaryotes evolved from 
archaea (right) rather than representing a separate branch of life (left). 


Bacteria Eukaryotes Archaea 


off a buzz among evolutionary biologists. 
“Tt tells us something very important about 
the origin of eukaryotes,’ says Eugene 
Koonin of the National Center for Biotech- 
nology Information in Bethesda, Maryland. 
“The ancestor of eukaryotes was a highly 
complex organism related to other ar- 
chaea.” The deep-sea microbe “looks like a 
potential transitional form” that preserves 
one of the evolutionary steps between ar- 
chaea and eukaryotes, adds evolutionary 
cell biologist Mark Field of the University 
of Dundee in the United Kingdom. 

Unlike prokaryotes, eukaryotes sport 
organelles such as power-generating mito- 
chondria and—in plants and some protists— 
light-capturing chloroplasts. Moreover, they 
stow DNA inside a nucleus that’s enclosed 
by a membrane, and their cells feature other 
internal structures built of membranes, such 
as the Golgi apparatus, lysosomes, and the 
endoplasmic reticulum. 
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Bacteria Leki 


Eukaryotes 


Archaea 


Ettema of Uppsala University in Sweden 
and colleagues analyzed 10 grams of sedi- 
ment brought up from the ocean bottom 
midway between Greenland and Norway. 
They found that it contained distinctive 
gene sequences that indicated the presence 
of unidentified microorganisms. Although 
the researchers could separate out only a 
few nanograms of shattered DNA, they 
were able to use a technique called metage- 
nomics to piece together these fragments 
and produce partial genomes for three new 
types of archaea. 

The researchers weren’t able to isolate 
any living or dead microbes from the muck, 
but they could infer the characteristics of 
the one with the most complete genome. 
(Because the sediment sample came from 
near the undersea volcanic vents known 
as Loki’s Castle, the researchers dubbed 
this organism Lokiarchaeum, or Loki for 
short.) Its genome reveals several telling 
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similarities with eukaryotes, the research- 
ers say. For example, actin proteins help 
form the cytoskeleton that braces eukary- 
otic cells and enables them to move. Loki 
carries actinlike genes that are more simi- 
lar to the eukaryotic versions than are the 
varieties found in other archaea. 

In eukaryotic cells, enzymes known as 
small GTPases perform a slew of functions, 
from shaping the cytoskeleton to orchestrat- 
ing the transportation of material in tiny 
membrane containers called vesicles. Genes 
for a handful of these enzymes have turned 
up in bacteria and in other archaea, but “we 
found 60 to 70 of those guys in Loki,’ Ettema 
says. Unlike other prokaryotes, Loki also 
harbors several genes for parts of the ESCRT 
protein complex that in eukaryotes bends 
and snips membranes, abilities important 
for cell division and for forming vesicles that 
ferry molecular junk for disposal. 

“This is the most eukaryotelike prokary- 
ote we’ve ever seen,” says evolutionary bi- 
ologist James McInerney of the National 
University of Ireland, Maynooth. An evo- 
lutionary tree based on the genomes of all 
three new archaea suggests that they are 
the closest known relatives of eukaryotes. 
To McInerney, the result “is another stake 
through the heart of the three-domain tree.” 

Loki’s molecular features suggest that 
eukaryotes’ ancestor could have had an ac- 
tin cytoskeleton and might have been able 
to gobble up cells or other types of food. 


“Tt tells us something very 
important about the origin 
of eukaryotes.” 


Eugene Koonin, National Center 
for Biotechnology Information 


It might also have had the beginnings of 
internal structure, with a transport system 
involving vesicles. However, it likely lacked 
some signature features of eukaryotes, in- 
cluding a nucleus and mitochondria. 

Not everyone is persuaded that Loki 
bridges the gap between eukaryotes and 
prokaryotes. The claim is based on too few 
genes, says evolutionary biologist William 
Martin of Heinrich Heine University Diis- 
seldorf in Germany. “Six proteins, does that 
bridge the gap? Not in my book,” he says. 

If researchers could pinpoint some of 
Loki’s cells, they might be able to strengthen 
the case by confirming that it really has 
the features hinted at by its genes. The 
hunt is on, Ettema says. “We are currently 
looking very hard to get these cells out 
of sediments.” & 
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BIOMEDICAL RESEARCH 


Japan's ‘NIE’ starts with modest 


funding but high ambitions 


Key priority is filling nation’s sparse drug pipeline 


By Dennis Normile 


apan’s biomedical community has 
long envied the funding and political 
clout of the U.S. National Institutes of 
Health (NIH). Now, after 2 years of 
planning, the country has its own ver- 
sion in the Japan Agency for Medical 
Research and Development (AMED). 

Officially launched last month, AMED 
has a wide-ranging mandate to smooth the 
flow of basic discover- 
ies to the clinic and 
the market. But unlike 
NIH, AMED does not 
have its own campus 
or facilities. The fledg- 
ling agency's budget— 
roughly $1.2 billion this 
year—is also a fraction 
of NIH’s $30 billion in 
2015, and three differ- 
ent ministries have a 
say in how much of that 
money is spent. Makoto 
Suematsu, AMED’s first 
president, says he aims 
to increase the portion 
of the budget under his direct control and 
improve cooperation between the minis- 
tries. “The moment of truth will come in the 
next 1 or 2 years,” he says. 

In spring 2013, Prime Minister Shinzo Abe 
made biomedical reform a central plank of 
his economic growth strategy. He noted that 
despite success in cutting-edge research, Ja- 
pan was not competitive in world markets 
for drugs and medical devices. Driving that 
point home, a 2013 paper in the journal 
Joho Kanri reported that in 2012, Japanese 
pharmaceutical companies had 780 small 
molecule drug candidates in the clinical 
pipeline while U.S. competitors had 4110. 

Abe and others suspected that bureau- 
cracy was at least partly to blame for a 
feeble drug pipeline. Separate ministries 
oversee each of the three activities criti- 
cal to developing new treatments: basic 
research, health care delivery, and com- 
mercialization. As a result of this bal- 
kanization, for example, in the mid-2000s 
the education ministry (in charge of ba- 
sic research) purchased high-throughput 
genome sequencers under a scheme to 
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AMED’s “moment of truth,” Suematsu says, 
will come in the next 1 or 2 years. 


promote personalized medicine. But regu- 
lations precluded the machines from being 
used for cancer research at hospitals, which 
are overseen by the health ministry. Simi- 
lar barriers block collaborations in regen- 
erative medicine, robotics, and other work. 
Suematsu, a medical doctor and biochem- 
ist, says that one of his primary objectives is 
“to tackle these barriers one by one.” 

When first announced, Abe’s “Japan NIH” 
plan got a rocky reception from academics. 
Worried that an em- 
phasis on applications 
would come at the ex- 
pense of basic research, 
the heads of 52 scientific 
societies in June 2013 
signed an “Emergency 
Declaration” expressing 
“strong concerns about 
the distribution of re- 
sources and the process 
of training researchers.” 
Those concerns have 
largely evaporated, as 
the largest source of 
grants to academics 
(about $2.5 billion in 
2014) remains independent of AMED. 

AMED is charged with advancing work 
in fields ranging from infectious diseases to 
cancer and brain science to rare maladies 
and regenerative medicine. The agency will 
start to stake out its own territory later this 
year, Suematsu says, when it begins its own 
grantmaking. It will start small, with just 
$170 million in funding up for grabs. 

Grants won’t be enough to fill the treat- 
ments pipeline, however. In the United 
States, small and venture businesses play 
a key role in shepherding new pharma- 
ceuticals though preclinical work and tri- 
als. But Japan is weak in mechanisms to 
license academic discoveries and _ lacks 
“systems which support small and medium 
enterprises and [startup] ventures,” says 
Yoshiyuki Osabe, an intellectual property 
specialist at the Japan Patent Office in Tokyo. 

“AMED can’t do it alone,” agrees Ryuichi 
Morishita, a gene-therapy scientist at 
Osaka University. Other initiatives in the 
works, he says, will address gaps that must 
be filled to make Japan’s medical industry 
an engine of economic growth. & 
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When modern birds took flight 


Feathered fossils from China show that modern birds could 
fly and wade earlier than expected 


By Michael Balter 


ut yourself on the planet 130 million 

years ago. Most of the animals, from 

horned dinosaurs to swimming ple- 

siosaurs, would be deeply alien, not 

to say terrifying. But rising from the 

wetlands and winging across the sky 
were birds startlingly like today’s. That’s 
the message from two bird skeletons— 
spectacularly preserved with feathers and 
all—reported this week. 

The 130-million-year-old fossils of wad- 
ing birds, found in northeast China by a 
team led by paleontologists Min Wang and 
Zhonghe Zhou of the Institute of Verte- 
brate Paleontology and Paleoanthropology 
in Beijing, push back the lineage that led 
to today’s birds by at least 5 million years 
and make it almost certain that its origin 
was much older still. The fossils’ special- 
ized anatomy suggests that key factors in 
birds’ long-term success, such as expert 
flying ability and rapid growth rates, arose 
surprisingly early in avian evolution. 

“New bird fossils seem to come 
out every week now, and they 
are revolutionizing our under- 
standing of bird evolution. But 
of all the new specimens, this is 
one of the most important found 
over the last decade,” says paleon- 
tologist Stephen Brusatte of the 
University of Edinburgh in the 
United Kingdom. Not only does 
the bird look nearly modern, but 
it was also apparently a water 
dweller, showing that “ancient 
birds became specialized in their 
respective habits” very early, says 
paleontologist Luis Chiappe of 
the Natural History Museum of 
Los Angeles County in California. 

Birds were born about 
150 million years ago, when a 
group of small meat-eating dino- 
saurs spread their feathered wings 
and took to the skies. They soon 
split into two distinct groups: the 
lineage that led to modern birds, 
called the ornithuromorphs, and 
the so-called opposite birds, or 
enantiornithines, whose shoulder 
ball-and-socket joints connected 
in an inverse way from those of 
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This 130-million-year-old bird, the oldest known member of the | 
to modern birds, had feathers suited for skillful flight. 


living birds. Relatively poor fliers, the op- 
posite birds also typically had teeth and 
clawed wings. They thrived for millions of 
years, but vanished along with their dino- 
saur relatives in the mass extinction at the 
end of the Cretaceous. 

Meanwhile, the lineage of modern birds 
evolved “huge chest muscles and wings 
comprised of many different types of feath- 
ers layered over each other’—features es- 
sential to high-powered flight, Brusatte 
explains. Their bone structure also sug- 
gests that they grew much faster than the 
opposite birds. But researchers did not 
know when those features emerged. They 
had found some excellent specimens of the 
earliest birds, such as Germany’s famed 
150-million-year-old Archaeopteryx, as well 
as stunning later fossils from northeastern 
China (Science, 15 March 2013, p. 1261). But 
a yawning gap remained between Archae- 
opteryx and other fossils, most of which 
were opposite birds, Chiappe says. 

The new fossils help fill the gap in time 
and also in anatomy. Writing in Nature 
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Communications, the Beijing team de- 
scribes two skeletons of a species they call 
Archaeornithura meemannae. (Archaeor- 
nithura means “ancient ornithuromorph,” 
and meemannae is in honor of Chinese 
paleontologist Meemann Chang.) Each ex- 
quisitely preserved specimen has the tell- 
tale traits of a modern bird: fan-shaped tail 
feathers, highly fused bones at the ends 
of the wings, and the U-shaped wishbone 
familiar to anyone who has carved a roast 
chicken. The fossils even have a small pro- 
jection on the front edge of their wings— 
known to boost maneuverability during 
flight—that is remarkably similar to that of 
today’s birds. 

Furthermore, Archaeornithura had long 
legs and feet apparently adapted to wad- 
ing in water, similar to those of today’s plo- 
vers, suggesting that modern birds arose 
in aquatic habitats. Finding such a mod- 
ern bird, already specialized for wading, 
suggests that millions of years of aquatic 
evolution took place even before A. mee- 
mannae came on the scene, Zhou explains. 
He suggests that while the opposite birds 
found safety in the trees, life in more open, 
aquatic spaces may have given the ornithu- 
romorphs “more choices of high protein 
food” and favored their evolution into 
swift fliers so they could avoid the danger 
of predators. All this evolution must have 
happened after Archaeopteryx but before 
130 million years ago. 

The fossils reveal the origins of 
the features that, tens of millions 
of years later, may have allowed 
modern birds to survive the Cre- 
taceous extinction when other 
birds did not, Zhou and Wang say. 
No one is sure just what condi- 
tions prevailed in the postasteroid 
apocalypse, but Wang speculates 
that the fast growth rates of mod- 
ern birds, which let them reach 
adulthood faster and spend less 
time dependent on their parents, 
may have given them an advan- 
tage. Skilled flight may also have 
been a boon. 

But this idea is “too simplis- 
tic,” counters Sankar Chatterjee, 
a paleontologist at Texas Tech 
University in Lubbock, because 
many ornithuromorphs vanished 
during the extinction, too. Never- 
theless, Brusatte argues, “from 
the ashes of that extinction, a 
few groups of more sophisticated 
birds, with better flight abilities 
and perhaps faster growth rates, 
were able to survive.” Those key 
traits, the new fossils show, arose 
near the dawn of bird evolution. & 
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GENOMICS 


New database links regulatory 
DNA to its target genes 


Recent big genomics projects aid disease studies 


By Elizabeth Pennisi 


cientists have known for years that 

the genome is much more than a set 

of codes for proteins. It is also a giant 

switchboard, riddled with sequences 

that control gene activity. This extra 

layer of complexity has hampered 
searches for the genetic basis of diseases and 
for drugs that would target just the DNA at 
fault. In the past few months, however, sev- 
eral major research consortia have delivered 
what amount to user’s manuals for the ge- 
nome, mapping the locations of thousands of 
those switches, the specific genes they con- 
trol, and where in the body they are turned 
on or off. 

On pages 648, 660, and 666, the latest and 
arguably boldest of these big biology efforts 
reports preliminary results. By analyzing ge- 
netic material gleaned from more than 
100 people who had died just hours 
before, the Genotype-Tissue Expres- 
sion (GTEx) project catches gene regu- 
lation in action, identifying the genes 
switched on or off by subtle changes in 
DNA within 2 million bases of any gene. 
By evaluating multiple tissues from 
each body, it also charts the reach of those 
regulatory sequences across cell types—some 
affect a gene in all tissues; others are influen- 
tial in a few tissues or just one. 

Three efforts reported earlier took other 
approaches to mapping the genome’s many 
switches. Two, called BLUEPRINT and the 
NIH Roadmap Epigenomics Project, chased 
down the locations of DNA and its associ- 
ated proteins that are the target of chemical 
modifications called epigenetic marks, which 
determine whether a gene can be activated. 
A third, the latest iteration of a 20-year effort 
called FANTOM (Functional ANnoTation Of 
the Mammalian genome), provides an exten- 
sive catalog of the beginnings of genes and of 
their control sequences. Thanks to these four 
efforts, “we are on the cusp of learning a lot 
more about genome function,” says Jonathan 
Pritchard, a geneticist at Stanford University 
in Palo Alto, California. 

Not everyone is persuaded that these 
massive data-gathering efforts offer much 
practical help to biologists. “I am not a fan of 
big science,” says Dan Graur, an evolution- 
ary geneticist at the University of Houston 
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in Texas. Simon Xi, a computational biolo- 
gist in Cambridge, Massachusetts, who is 
using the GTEx data in his work on drug 
development, believes the databases are vi- 
tal, however, but says they could be more 
user-friendly: “The challenge for us is how 
to integrate all those data.” 

The new work aims to address an ongo- 
ing source of frustration among disease re- 
searchers. A decade ago, geneticists set out 
to link specific DNA sequences to common 
diseases. In so-called genome-wide asso- 
ciation studies (GWAS), massive consortia 
pooled tens of thousands of patients and 
came up with thousands of subtle genetic 
changes, called single nucleotide polymor- 
phisms (SNPs), which appeared to increase 
the risk of inflammatory bowel disease, 
schizophrenia, autism, and a whole host of 
other common disorders. Puzzlingly, many 


“We are on the cusp of learning a 


lot more about genome function.” 
Jonathan Pritchard, Stanford University 


of these changes occurred outside genes 
(Science, 27 May 2011, p. 1031). The muta- 
tions presumably affected gene expres- 
sion. But how? The impasse “suggested we 
needed to get moving on understanding 
regulatory variation,” recalls Nancy Cox, a 
quantitative human geneticist at Vanderbilt 
University in Nashville. 

FANTOMS, a $100 million effort led by 
the RIKEN institute in Japan, has provided 
part of the answer by mapping two kinds 
of regulatory sequences in the genome: 
“promoters” that help kick off transcription 
and are located at the start of a gene, and 
“enhancers,” regulatory DNA that can be 
far from the genes they act on. The proj- 
ect developed technology to capture RNA 
right as it starts to form off the DNA, which 
pinpoints promoters. It catalogs enhancers 
as well, because these pieces of regulatory 
DNA are also transcribed into RNA. Led by 
RIKEN’s Yoshihide Hayashizaki, FANTOM5 
surveyed RNA in every major human organ, 
hundreds of cancer cell lines, more than 
200 purified primary cell types, and in cells 
at various stages of differentiation. 
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Earlier this year, the team described 
201,000 human promoters and 65,000 
human enhancers, showing that genes 
often contained several promoters that 
were activated differentially in various 
tissues (Science, 27 February, p. 1010). 
The effort is “absolutely unprecedented,” 
says Bing Ren, a molecular geneticist at the 
San Diego, California, branch of the Ludwig 
Institute for Cancer Research. “This is really 
a significant resource.” 

The $300 million NIH Roadmap Epi- 
genomics project took a different approach 
to identifying enhancers. It mapped the 
epigenetic changes that associate with en- 
hancers. For each cell type studied, assays 
of chemical marks called methylation and 
other changes in the DNA-protein matrix 
called chromatin helped pinpoint enhanc- 
ers. Based on their sequences, investigators 
were also able to identify the proteins that 
help those enhancers turn on genes. The 
study, reported on 19 February in Nature, 
included 127 reference epigenomes—all the 
epigenetic marks on a genome—for various 
embryonic and adult tissues and cell types, 
including immune, brain, heart, muscle, gut, 
fat, and skin cells. 

The European Union’s €30 million BLUE- 
PRINT project took an even deeper look 
into epigenomes, focusing on white 
and red blood cells. It determined the 
epigenomes of the primary blood stem 
cells and of those cells at various stages 
in their differentiation into mature 
white or red cells. Among other goals, 
BLUEPRINT is looking for differences 
between these cellular epigenomes in 
healthy individuals and people with leuke- 
mia, whose blood cells proliferate uncon- 
trollably. “BLUEPRINT is going to show, 
together with Roadmap, the rules of regula- 
tion of gene expression,” says BLUEPRINT’s 
Willem Ouwehand, an experimental hema- 
tologist at the University of Cambridge in 
the United Kingdom. 

Once a GWAS identifies a SNP, data from 
Roadmap, BLUEPRINT, or FANTOM can 
provide further evidence that it might influ- 
ence health by showing whether the varia- 
tion falls in a regulatory region. GTEx goes a 
step further: It pins down how genetic varia- 
tion, particularly in noncoding DNA, affects 
a gene's activity across different parts of the 
body. To measure that gene activity, the proj- 
ect pulls out the transcribed RNA of tissues 
in multiple individuals. The researchers can 
then correlate changes in levels of specific 
RNA transcripts—an indicator that specific 
genes are active—with SNPs or other DNA se- 
quence variations. “It will help the research- 
ers narrow down the discoveries they have 
made in GWAS studies,” says Simona Volpi, 
a pharmacologist at the National Human 
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NIH Roadmap Epigenomics 
Project has cataloged chemical 
modifications to the genome, 
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how accessible genes are for 
activation. It looked at 127 cell 
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ENCODE carried out bio- 
chemical assays on hundreds 
of mouse and human cell 
types to identify elements of 
the genome that may play a 
role in gene expression. 
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Genome Research Institute based in Rock- 
ville, Maryland, who helps coordinate GTEx. 

Because the researchers needed multiple 
tissue samples from internal organs—too 
many to collect from living people—they 
turned to recently deceased people whose 
kin donated their bodies for research. The 
ultimate goal of the $100 million NIH- 
funded project is to collect and analyze about 
25,000 tissues from 900 individuals; the data 
published so far include RNA from up to 
43 tissue sites from 175 people. With those 
samples, “GTEx has the power to compare 
tissues within individuals and across individ- 
uals,” Volpi adds. “This is something that no- 
body has been able to do before at this scale.” 

Xi, who is tracking down drug targets for 
depression, schizophrenia, and Alzheimer’s 
and Parkinson’s diseases, is already turning 
to GTEx data to follow up on SNPs previously 
implicated in those brain disorders. “Having 
GTEx data helps connect the dots,’ he says. 
The data also enable him to check whether 
DNA sequences implicated by GWAS are only 
active in the brain. That could make them es- 
pecially promising drug targets, reducing the 
risk of broad side effects. 

Graur, who made a name for himself with 
his withering criticisms of a predecessor ge- 
nomics project called ENCODE, distrusts the 
GTEx results because the project relied on 
postmortem samples. RNA degrades quickly, 
he notes. “If you want to do [gene] expres- 
sion, you have to have live organisms,” he 
says. GTEx counters that it has validated that 
samples taken within 6 hours of death faith- 
fully reflect natural gene activity. 

Still, GTEx and the other genomic 
projects have shortcomings. They don’t 
comprehensively cover all tissues, disap- 
pointing some eager for such data on their 
favorite cell type. Diabetes researcher Mark 
McCarthy of the University of Oxford in the 
United Kingdom, for example, is doing his 
own mini Roadmap and GTEx projects on 
pancreatic islet cells, which were omitted 
from the large-scale efforts. And tapping 
these genomic databases can also be chal- 
lenging. “I see people going to the sites and 
struggling,” says Chris Tyler-Smith, an evo- 
lutionary geneticist at the Wellcome Trust 
Sanger Institute in Hinxton, U.K. 

Nonetheless, Tyler-Smith welcomes GTEx 
and the other genomic projects, saying they 
represent “getting together in large groups 
to do things that one could hardly do in 
one’s lab.” And for people like McCarthy, 
there is newfound hope of unraveling the 
complex genomic networks that underlie 
diabetes and other diseases. “I’m more op- 
timistic than 4 to 5 years ago when it really 
wasn’t clear how we were going to deal with 

these regulatory signals,” he says. “We now 
have quite a few a clues.” & 
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FOOD SECURITY 


Italy’s olives under siege 


Blight alarms officials across Europe 


By Erik Stokstad 


ll across the heel of Italy’s boot, 
centuries-old olive trees are falling 
victim to a devastating pathogen. 
“It’s beyond belief,’ says Brion Duffy, 
a plant bacteriologist at the Zurich 
University of Applied Sciences in 
Switzerland, who has visited the Salento 
peninsula and seen hectare upon hectare of 
desiccated, dying trees. “This is the center 
of olive production in Italy, Europe, and the 
world, and it’s like a bomb went off.” 

In spite of control efforts, the disease is 
spreading north, threatening olive groves 
throughout Italy and beyond and ring- 
ing alarms across Europe. In January, the 
European Food Safety Authority 
(EFSA) warned of yield losses and 
rising costs from control measures. 
Italy declared its first national 
emergency for a plant disease 
and appointed a special commis- 
sioner with new powers. And last 
month, France closed its border to 
all susceptible plant varieties from 
Salento, sparking accusations of 
a trade war. “It’s a quite major is- 
sue for Europe,” says Jean-Claude 
Grégoire, an entomologist with the 
Université libre de Bruxelles. 

The culprit is Xylella fastidi- 
osa, a bacterium widespread in 
the Americas that has been on Eu- 
rope’s quarantine list since 1981. 
One strain appears to cause mild 
symptoms in olive groves in Cali- 
fornia but does not kill trees. Other 
subspecies have wreaked havoc in 
South American citrus groves and 
North American vineyards. The microbe 
kills plants by multiplying inside vascular 
tissue, thereby slowly clogging their wa- 
ter transport system. Many kinds of sap- 
sucking insects transmit it, even from 
healthy-looking plants—or from hundreds 
of mildly susceptible host species—making 
it extremely hard to contain. 

X. fastidiosa’s assault on Italy was un- 
covered by Donato Boscia, a plant virologist 
with the Italian National Research Council’s 
Institute for Sustainable Plant Protection in 
Bari. He first observed unusual symptoms 
in his father-in-law’s olive grove in the 
summer of 2013. Desiccated leaves clung 
to branches for several months, instead of 
falling off. Other farmers had noticed the 
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problem, too. That October, Boscia and his 
colleagues confirmed the culprit was X. fas- 
tidiosa and also found it in almond trees 
and oleander growing near infected groves 
in the province of Lecce (see map). The Eu- 
ropean and Mediterranean Plant Protection 
Organization was quickly alerted. Within 
weeks, Italy banned the movement of olive 
saplings and other susceptible plant mate- 
rial from Lecce. 

But the disease spread. By December 
2013, some 8000 hectares were in trouble. 
Often entire orchards fell sick, with the larg- 
est and oldest trees hit hardest. “People are 
observing the decline of trees that were cul- 
tivated by their parents and grandparents,” 
Boscia says. Two months later, the European 


Bringing a pathogen to heel 
Italy hopes that an anti-Xylella cordon will save olive trees 
north of Lecce province. 
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Union banned transport of most plants 
from the affected region, which has hurt 
commercial nurseries, and required mem- 
ber nations to start testing for X. fastidiosa. 

Boscia and colleagues raced to figure 
out what was going on. Genetic markers 
match those of a strain isolated from orna- 
mental coffee plants and oleander exported 
from Costa Rica to Europe. Infected plants 
may have arrived in Italy while they were 
asymptomatic. In any event, European in- 
spections and testing have been too lax, 
says Giovanni Martelli, a plant virologist 
with the University of Bari Aldo Moro. 

In Italy, the primary vector is the spittle- 
bug Philaenus spumarius, Boscia and col- 
leagues reported in August 2014 in the 
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Journal of Economic Entomology. These 
insects are common across Europe and 
abundant in olive groves. In summer, 
adults fly up from the grass into the trees, 
feeding on sap and repeatedly infecting 
each tree. Last August, tests revealed that 
up to 80% of the insects in afflicted or- 
chards were carrying the bacterium. “It’s 
terrifying,” Martelli says. “It’s an army of 
insects loaded with bullets.” 

By October 2014, X. fastidiosa had spread 
to an estimated 23,000 hectares. This past 
February, the agriculture ministry tripled 
its funding, to €7.6 million, for combating 
the blight. (The regional government has 
allocated another €5.9 million.) To pre- 
vent X. fastidiosa from spreading north, 
authorities designated a 3-kilometer-wide 
eradication strip and buffer zone span- 
ning the peninsula; workers are clearing 
it of sick trees and host plants like ole- 
ander and will spray insecticides to kill 
adult spittlebugs. Elsewhere, farmers have 
plowed thousands of hectares of orchards 
to kill juvenile spittlebugs. 

Some environmental groups 
doubt that X. fastidiosa is the 
cause of the declines. They sus- 
pect endemic fungi or leopard 
moths, which have plagued olives 
in the past. (Last month, EFSA 
issued a report dismissing this 
theory.) The groups have rallied 
protesters to block efforts to cull 
sick olive trees. In February, a spe- 
cial commissioner was empow- 
ered to enforce tree removal, and 
police have been called in to keep 
protesters away from chainsaw- 
wielding forestry workers. 

Yet the disease continues to ad- 
vance. Last month, infected trees 
were discovered about 30 kilome- 
ters north of the containment zone. 
Quick action might eliminate this 
new hotspot, Martelli says. And 
more measures are in the works. 
An E.U. advisory committee last week rec- 
ommended that the European Commission 
ban imports of ornamental coffee plants 
from Costa Rica and Honduras. It also 
specified that countries should squelch new 
outbreaks by removing infected plants and 
all host plants within 100 meters. 

At ground zero, the epidemic has taken a 
grim toll, says Blanca Landa, a plant pathol- 
ogist with the Spanish National Research 
Council’s Institute for Sustainable Agricul- 
ture in Cordoba who visited Lecce in No- 
vember. “I saw a farmer crying in his field,” 
she says. “He told me, ‘I’m taking care of my 
olives better than my own children’ ” Now, 
his orchard is dying. “I came back feeling 
completely demolished,” Landa says. ™ 
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Ht PILLS 


Raymond Schinazi’s handful of 
lifesaving drugs has earned him riches, 
esteem, and a dose of enmity 
By Jon Cohen 


n the deck of a sportfishing boat 
on a sparkling Saturday morn- 
ing in Mexico’s Gulf of California, 
Raymond Schinazi is singing. 


Sky of blue 
Sea of green 
In my yellow submarine 


The fish aren’t biting, but this 65-year-old 
organic chemist from Emory University in 
Atlanta is delighted to watch the humpback 
whales spout, soak up the Baja sun, and be 
out of cell tower range. “There is lots of sea, 
and there are lots of fish,’ he says after a few 
hours pass without landing a big one. “You 
have to be an optimist.” 

The name of the boat is No Big Deal. 

Few scientists who work in academia 
have benefited more from big deals than 
Schinazi. He specializes in the synthesis of 
so-called nucleoside and nucleotide drugs 
that interfere with viral replication—an idea 
once widely dismissed. Two years ago, Schin- 
azi pocketed $440 million on a deal involv- 
ing sofosbuvir, a yellow submarine-shaped 
pill that has revolutionized hepatitis C 
(HCV) treatment and had blockbuster sales 
for Gilead Sciences of Foster City, California. 
In 2005, an anti-HIV drug he helped develop, 
FTC, netted $540 million for Emory—which 
gave $71 million to Schinazi. Emory and 
Schinazi have made millions more in royalty 
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fees for a sister drug, 3TC, which treats both 
HIV and hepatitis B. And Schinazi, who also 
runs a lab at Atlanta’s VA Medical Center, had 
a hand in two other drugs on the market. He 
holds more than 90 patents and recently 
helped form a company that he contends will 
make still more effective drugs against hepa- 
titis B and C. But on his way up, Schinazi has 
rocked a few boats. 

A bear-sized man who speaks bluntly, ne- 
gotiates fiercely, and favors splashy, multi- 
colored shirts, Schinazi is a polarizing force. 
Friends and colleagues call him a genius, a 
superstar, a hero. James Curran, who heads 
the Rollins School of Public Health at Em- 
ory, offers quieter praise. “He’s a very, very 
solid academic researcher who is out-of-the- 
world entrepreneurial.” 

Others wince at what they see as his abra- 
sive style and charge that he receives more 
profit and scientific credit than he deserves. 
His former business partner Jean-Pierre 
Sommadossi, who heads Atea Pharma- 
ceuticals in Cambridge, Massachusetts, says 
Schinazi “wasn’t involved with anything” 
in the actual development of the HCV drug 
sofosbuvir, other than starting the company 
that invented it. The two no longer speak. 
Medicinal chemist David Chu of the Uni- 
versity of Georgia, Athens, who published 
more than 100 papers with Schinazi, cut 
ties, too. Protracted lawsuits have challenged 
Schinazi’s patents and financial decisions. He 
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Organic chemist Raymond Schinazi 

is an unusual academic-entrepreneur 
hybrid who has made a fortune on drugs 
against HIV and hepatitis B and C. 
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also suffered collateral damage when Gilead 
debuted sofosbuvir at $1000 per pill, which 
one news article pilloried as a “poster child 
for pharmaceutical greed.” 

Schinazi, who lives mostly in Atlanta, 
makes no apologies for his wealth, which al- 
lows him to enjoy a plush ocean-view condo 
in Baja’s San Jose del Cabo (I stayed in a 
spare bedroom because of its remote location 
and our tight schedule), a third home in Mi- 
ami, and the occasional flight on a private jet 
service. “I save lives,’ he tells me as we wait 
for a fish to strike. “If I make money along 
the way, what’s wrong with that?” Schinazi 
sees a common motivation driving his crit- 
ics. “When youre very successful,” he says, 
“people are very jealous.” 


THE FRIDAY EVENING BEFORE the fishing 
expedition, I join Schinazi on the 30-minute 
drive from his secluded condo into the rowdy 
party town of Cabo San Lucas. But we do not 
stop at the Cabo Wabo Cantina, the Jungle 
Bar, or the Husband Day Care Center. We at- 
tend synagogue. 

The service takes place in the apartment 
of a Chabad rabbi who wears a traditional 
black hat and a 17th century black frock 
coat. We sit with 20 or so men, expats from 
all over the world, dressed in T-shirts, flip- 
flops, shorts, and fedoras. Save for the ark 
holding a Torah and the room divider sepa- 
rating men from women in Orthodox Jew- 
ish tradition, we could be at a poker game. 
When we come to a passage about the Jews’ 
flight from Egypt, Schinazi leans over and 
says, “I lived it. A real exodus.” 

Schinazi was born in 1950 in Alexandria, 
Egypt, where his father had an import/export 
business, a glue factory, and made sheep cas- 
ing for sausages. Raymond attended a pri- 
vate British school. “When he was around 9, 
10 years old I asked him what he wanted to 
do,’ says childhood friend Yves Salama. “He 
said, ‘I’m going to be a scientist and I’m go- 
ing to cure cancer’ I was quite impressed he 
knew there was such a thing as cancer.” 

Salama says he and Schinazi “were free- 
range children” who had great fun shooting 
clay pots with their BB guns and making 
tinfoil bombs with gunpowder from shotgun 
shells. But Schinazi’s idyllic childhood ended 
abruptly in 1962 when the family returned 
from vacation to find their home sealed, 
bank accounts frozen, and cars gone. It 
was part of President Gamal Abdel Nasser’s 
socialist agenda and his push to oust Jews. 
“Everything was taken from us,’ says Schin- 
azi, whose family moved into a hotel. 

By 1964, the Schinazis were broke and 
took a ship to Naples, Italy, where they lived 
in a refugee camp for a time. An elite board- 
ing school in England for Jewish children 
eventually accepted Raymond. From there he 
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went to England’s University of Bath, where 
he completed an undergraduate degree and 
a Ph.D. in chemistry. He lived on 100 English 
pounds a month, working as a parking at- 
tendant to help pay his way, and didn’t have 
real money in his pocket until he won 3000 
pounds in the Spanish lottery. 

While at Bath, Schinazi met his mother’s 
brother, pediatrician André Nahmias, who left 
Egypt before his nephew was born and had 
become a well-known herpesvirus researcher 
at Emory. Nahmias helped Schinazi secure a 
postdoc at Yale with William Prusoff, a chem- 
ist dubbed “the father of antiviral chemo- 
therapy.” Prusoff synthesized the first anti- 
viral used in clinical practice, a drug called 
idoxuridine that treats herpes infection of 
the eye. It is a synthetic nucleoside. 

Nucleosides are chemical precursors to 
the nucleotides that make up DNA and 
RNA. When viruses copy them- 
selves, they string together nucleo- 
tides. Chemical modifications can 
create defective analogs of these 
building blocks. Viruses incorpo- 
rate them into a growing chain of 
nucleotides but usually cannot add 
new links, gumming up production 
of genetic material and aborting vi- 
ral replication. 

Nucleotide analogs typically make 
lousy drugs: The molecules are highly 
charged and have difficulty entering 
cells. Not so for nucleoside analogs, 
which cells convert into nucleotides. 
But they can cause serious toxicities 
if they get into human DNA or RNA. 
At the time Schinazi joined Prusoff’s 
lab in 1976, many drug developers 
shunned them. 

After 3 years with Prusoff making 
nucleoside analogs, Schinazi moved 
to Emory to study viruses and immunol- 
ogy with his uncle. “I knew that the glory 
was in biology,” he says. “I wanted to save 
the world.” But he was confident that his ap- 
titude for chemistry would help. “I look at 
structures and they mean something to me,” 
he says. “It’s a language.” 


BRENDAN LARDER MET SCHINAZI in the 
early 1980s when he exited a herpesvirus 
conference held at the University of Oxford 
and stepped into the street. “A car pulled up 
on the sidewalk, a window came down, and a 
voice said, ‘Get in; ” recalls Larder, who was 
then working on his Ph.D. in virology. Schin- 
azi knew that Larder was studying acyclovir, 
a nucleoside analog being developed by Bur- 
roughs Wellcome to treat herpesvirus, and 
wanted to pick his brain. In those days, Lar- 
der says, Schinazi “was not at all prominent.” 

Larder tells me this story at a swank hotel 
in Key Biscayne, Florida, where 125 research- 
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ers gathered in December 2014 for HIV 
DART, a biennial conference on HIV drug 
development that Schinazi co-organizes. 
(He hosts a similar hepatitis meeting in Ha- 
waii.) Many of the attendees wear the color- 
ful meeting T-shirt, sporting an aerial view 
of Miami that Schinazi shot from his nearby 
condo. “If you ask a lot of people why did you 
come here, they’1l say, ‘Because Ray asked me 
to; ” says Larder, who for years worked at 
Wellcome and now heads the HIV Resistance 
Response Database Initiative, a nonprofit 
based in London. “It’s a way to bring his 
friends together. Ray does polarize opinions 
and people’s emotions, but more people love 
him than hate him.” 

One morning, Schinazi gives the David 
Barry DART Achievement Award. He sings 
the praises of the deceased Barry, who once 
directed research at Wellcome and helped de- 


velop AZT, the first anti-HIV drug. He makes 
no mention of their complicated history. “Da- 
vid Barry was my arch enemy,’ Schinazi tells 
me during a meeting break. 

In 1986, Schinazi—independent of 
Wellcome—synthesized AZT and gave it to 
Sommadossi, a pharmacologist then with 
the University of Alabama, Birmingham. 
Sommadossi proceeded to show that it had 
toxic effects on bone marrow cells. Barry, 
anxious about AZT’s imminent approval 
by the U.S. Food and Drug Administration 
(FDA), dismissed the results, Schinazi says, 
asserting that they were due to impurities 
in the compound he had made. “He was a 
big shot and I was a nothing,’ Schinazi says. 
“He hated me.” The toxicity soon surfaced in 
human studies, but the FDA approved the 
drug nonetheless in March 1987. That same 
year, Schinazi helped Prusoff’s lab show that 
another nucleoside analog, d4T, had activ- 
ity against HIV—and it later would come to 
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market. But it was a third AIDS drug dubbed 
3TC that elevated Schinazi to the big leagues. 

In 1989, he learned of a potential drug 
developed by a Canadian biotech, BioChem 
International. The drug, a synthetic nucleo- 
side, contained an equal mixture of mirror- 
image “enantiomers’—left- and right-hand 
versions of the same molecule. Because 
enantiomers can have different powers 
and toxicities, Schinazi saw an opportu- 
nity. He enlisted Dennis Liotta, a chemist 
at Emory, to see if they could sort out the 
enantiomers. “I was a pretty good chem- 
ist and Ray had a background in virology,” 
Liotta says. “It turned out to be an excel- 
lent combination.” 

With help from one of Liotta’s postdocs, 
the team found a chemical reaction that ef- 
ficiently separated the enantiomers. The left- 
handed enantiomer, 3TC, was more potent, 
and they then synthesized a closely 
related compound, FTC, that in- 
cluded a fluorine atom. Subsequent 
studies showed that the drugs also 
had activity against hepatitis B virus. 
Schinazi had to lobby Emory to file 
patents for their compounds. “He's 
got natural entrepreneurial talents 
that most people have to teach them- 
selves,” Liotta says. “I think he can 
smell money.” 


IN 1993, LARDER LED a Wellcome 
team that found a remarkable test- 
tube effect of 3TC and FTC: HIVs 
that mutated and became resistant 
to these drugs became more sensitive 
to AZT. The synergy meant lower, far 
less toxic doses of AZT, plus it gave 
3TC and FTC a new reason for being. 
Novel combination therapies were in 
the offing. 

The next year, pharmaceutical giant Glaxo 
merged with Wellcome, leading Barry and 
a host of his colleagues to defect and form 
Triangle Pharmaceuticals. They joined forces 
with Schinazi, Liotta, and Chu, who had 
potential drugs on their university shelves. 
“[Barry] probably teamed up with Ray be- 
cause his enemy became Glaxo,” Larder says. 

The unusual synergistic features of 3TC and 
FTC—and their obvious market potentials 
—led to a tangle of lawsuits, with Emory su- 
ing Glaxo and BioChem for patent infringe- 
ment on 3TC, countersuits, and similar fights 
over FTC. The cases were all settled, trans- 
forming the fortunes of Schinazi and Liotta— 
and Emory. 

Triangle’s slow progress frustrated 
Schinazi, so he and Sommadossi formed 
two new drug companies in 1998 to focus 
on nucleoside analogs. “They weren’t taking 
any of our compounds, and I could see my 
technology sitting on a shelf,’ Schinazi says. 
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“T like speed.” Schinazi headed one, Pharma- 
sset, while Sommadossi ran the other, Idenix. 
“Both companies made some great scientific 
discoveries when nucleosides were basically 
considered poor science,’ Sommadossi says. 

Schinazi had a heart attack in 1998 while 
attending a meeting at Triangle. “Too much 
stress,” he says. “More than 30 years no va- 
cation, in the trenches, writing grants, try- 
ing to survive. I work too hard. You can ask 
my ex-wife.” Barry drove him to the hospital, 
where they discovered that Schinazi, then 48, 
had an arterial blockage known as a widow- 
maker. “You try to form two companies si- 
multaneously,’ he says. “It almost killed me.” 

Adding to his workload, Schinazi, 
Sommadossi, and a few other aggrieved re- 
searchers launched HIV DART that year 
because the main annual HIV/AIDS meet- 
ing in North America—the Conference on 
Retroviruses and Opportunistic Infections 
(CROI)—had repeatedly rejected many of 
their submissions. “We told the CROI people, 
‘Next time, if you don’t accept our abstracts, 
we're going to move our meeting to exactly 
the same date as CROJ; ” Schinazi says. HIV 
DART made its debut in December 1998, in 
St. Thomas, the U.S. Virgin Islands, while 
CROI took place in the harsh win- 
ter of Chicago. “I know how to play 
hardball,” Schinazi says. 

CROT has never invited Schinazi 
to give an honorary presentation 
or a plenary talk. He feigns in- 
difference, but he also says the 


made him super rich. Sofosbuvir, a nucleo- 
tide analog (it masks the negative charge), 
must be taken in combination with another 
drug, but unlike older HCV treatments, it has 
no serious side effects, works against virtu- 
ally all variations of the virus, and reduces 
the time to a cure from 48 to 12 weeks. Gilead 
bought Pharmasset for $11.4 billion to ob- 
tain sofosbuvir, which racked up more than 
$10 billion in sales last year, second only to 
one other drug on the market. But in the 
yin-yang of Schinazi’s life, this great success 
has brought him greater scrutiny than ever. 
First and foremost is sofosbuvir’s price 
tag of $1000 per pill. Gilead set the price, 
he notes, which he calls “obscene.” But at 
the same time, he blasts critics who sug- 
gest sofosbuvir’s price is not “reasonable” or 
“rational’—and that he profited too much. 
“Ts it fair to pay $3 for a bottle of water?” he 
asks. “Do you complain when you're thirsty? 
This is something that cures you from a dis- 
ease.” And he notes that Gilead has already 
dropped the price because it now combines 
the drug with a new HCV drug it makes and 
sells for $1125 per pill. The company also of- 
fers a steep discount for poor countries and 
allows generic sales; indeed, according to 


All the king’s pills 
Schinazi had a hand in the development of five FDA-approved 
drugs, three of which are used in popular combinations. 


Schinazi—“Look at the irony!’—he helped 
negotiate a price of about $11 per pill for his 
home country Egypt, which has the highest 
prevalence of HCV in the world. 

Schinazi’s precise role in sofosbuvir’s dis- 
covery is also contentious. He’s the co-author 
of a 2005 paper that discovered a precursor 
to the drug. But little of that potential drug 
reached the liver, where HCV resides. Pharma- 
sset then developed a “prodrug,” sofosbu- 
vir, that becomes active in the liver. “I give 
credit to Ray: He had the vision to start the 
company,’ says Michael Sofia, one of Pharma- 
sset’s key inventors of the prodrug. But when 
it comes to sofosbuvir’s discovery, he asserts, 
forget about Schinazi. “Go to the patents, go 
to publications—there are over 40 in that 
space,’ Sofia says. “His name’s not there.” 

Schinazi claims he now has a spectacu- 
lar HCV drug in the works. “We should be 
able to cure hepatitis C with just one drug,” 
he asserts. In November 2014, he merged 
his company RFS Pharma with Cocrystal 
Pharma, which he says also has strong leads 
on a hepatitis B cure and influenza drugs 
that work against all strains of the virus. 
He predicts Cocrystal will be as profitable 
as Pharmasset—and promises to give away 
his profits. “Everything I do now 
Tm giving to my charity,’ says 
Schinazi, who has bankrolled land 
for a new synagogue in Cabo San 
Lucas, an exchange program with 
the University of Bath, and en- 
dowed chairs at Emory. 
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gest Schinazi reel it in, but he has 


These drugs made Schinazi rich, 
but the hepatitis drug sofosbuvir 
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no interest. “You take it,” he says. “I 
only want the big ones.” & 
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saving California’s calves 


After a half-century of research, scientists find a 
promising solution to mysterious abortions in beef cattle 


eterinarian Tom Talbot of Owens 
Valley, California, remembers 
his ranching family’s first run-in 
with the livestock scourge known 
as foothill abortion. It was about 
1970, and his father had bought 
200 heifers from elsewhere in the 
state for breeding. He initially 
grazed them on leased pastures 
in the worst possible place for pregnant cat- 
tle: the foothill country of California. About 
a month before the new calves were due, 
Talbot’s father started finding aborted fe- 
tuses scattered across his valley ranch. 
More cropped up each day. “One hundred 
percent of those animals aborted,” recalls 
Talbot, who is a past president of the Cali- 
fornia Cattlemen’s Association. “It was the 
absolute most God-awful wreck you can 
have imagined.” 

The calves—and his father’s business— 
were victims of a devastating but highly re- 
gional cattle disease that causes pregnant 
heifers to lose their fetuses in the third tri- 
mester. Foothill abortion has not abated in 
the half-century since Talbot witnessed it. 
Even now it kills an estimated 5% to 10% of 
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By Ingfei Chen 


California’s annual beef calf “crop’—a loss 
of 45,000 to 90,000 animals, worth many 
millions of dollars. But the veterinary drug 
industry has ignored it, concentrating on 
bigger markets. And researchers have long 
struggled to find a cause or a cure. 

“Foothill abortion’s been an enigma. 
Everything about it has made it difficult 
to work with,” says University of California 
(UC), Davis, microbiologist Jeffrey Stott. 
But after years of sleuthing, he and his col- 
leagues identified the odd bacterium re- 
sponsible, which belongs to a class of soil 
microbes better known for their swarming, 
gliding behavior than for causing disease. 
And by stitching together funding and 
collaborations, the scientists developed a 
promising vaccine and are now testing it in 
thousands of heifers. 


ON A HOT, CLOUDLESS March afternoon 
in Buellton, California, a young Black Angus 
heifer bellows like a foghorn from inside a 
holding cage. Stott, in a baseball cap and 
faded red, long-sleeve shirt, has just injected 
the vaccine into the cow’s neck. A local vet- 
erinarian draws a blood sample from its tail, 
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and two cowboys affix bright yellow num- 
bered tags to the animal’s ears. In less than a 
minute, the cage clangs open and the protest- 
ing cow bolts away in a cloud of dust, rejoin- 
ing the nearby herd. 

Two hundred and fifty-five cows down, 
265 to go. “It moves pretty quick; it’s a non- 
stop process,” Stott yells with a grin over the 
rumble of a generator that powers the hy- 
draulically controlled cage doors. The crew 
was in a dawn-to-dusk push to administer 
the injections to more than 500 cattle pas- 
tured on the coastal foothills here, 70 kilo- 
meters northwest of Santa Barbara. 

On the flanks of the Sierras and the coastal 
mountain ranges in California, southern Or- 
egon, and Nevada, foothill abortion is as 
much a part of the landscape as the gnarled 
oaks. Stott’s home institution, UC Davis’s vet- 
erinary school, was founded in 1948 in part 
to study the disease, which was later named 
epizootic bovine abortion (EBA). In an early 
effort to find out what might spread it, preg- 
nant cows were hefted onto “treehouse” plat- 
forms, away from anything that crawls on the 
ground but still within reach of mosquitoes. 
The cows all produced healthy calves. 
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Heifers in California's foothills country are at risk 
of an infection that threatens their calves. 


By the 1970s, researchers at UC Davis 
and the UC Cooperative Extension had 
fingered Ornithodoros coriaceus, a brown, 
soft-bodied tick endemic to foothill regions, 
as the disease carrier. “The ticks love lying 
under the oaks,” Stott says—and cattle bed 
down there as well. Also known as the pa- 
jaroello (pa-ha-WAY-lo) tick, it is infamous 
in Native American lore for inflicting a more 
agonizing bite than a rattlesnake, he says. 

Tick-bitten cows don’t get sick from the 
subsequent infection, but if they’re pregnant 
and weren't exposed to the tick before, they 
often lose their babies. Abortion “storms” can 
wipe out 50% to 90% of new calves in a herd 
that lacks immunity—as ranchers such as the 
Talbots learned the hard way. 

By the time Stott joined the UC Davis 
faculty in 1985, researchers there were im- 
mersed in the hunt for a tick-borne pathogen 
in foothill abortion. Each decade has had its 
fashionable suspect: a chlamydia bacterium 
in the ’70s, a novel Borrelia bacterium (from 
the same family that causes Lyme disease) in 
the ’80s, and retroviruses in the early ’90s. 
They were all red herrings. 

Progress was tough. Conducting studies in 
pregnant cows, with their 9-month gestation 
periods, was slow, expensive work. Moreover, 
deliberately infecting animals for research by 
exposing them to hordes of ticks works only 
half the time. And because foothill abortion 
has a geographically limited impact, funding 
was scarce and major pharmaceutical com- 
panies had no interest, Stott says. 

But he and his colleagues Myra Blanchard 
and Mark Anderson began making head- 
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way after joining forces in 1992 with Mark 
Hall and Donald Hanks at the University of 
Nevada, Reno (UNR), which had a research 
cattle herd living in tick-free meadows. The 
team found they could reliably transmit 
EBA to the beasts by injecting them with 
ground-up thymus tissue from aborted bo- 
vine fetuses. 

In a pivotal experiment published in 2002, 
the scientists showed that treating pregnant 
heifers with heavy doses of peni- 
cillin and tetra-cycline at the time 
of the injections prevented the 
abortions. “That told us it was a 
bacterial pathogen,” Stott recalls. 

The perpetrator remained elu- 
sive. “You couldn’t culture it and 
you couldn’t see it” under a mi- 
croscope, Blanchard says. But 
with the help of new molecular 
biology methods, the researchers managed 
to extract some of the mystery microbe’s 
DNA from infected thymus tissue and iden- 
tify its bacterial family. In 2005, they pin- 
pointed it as a novel species of Myxococcus 
“slime bacteria,’ the first pathogen known 
within that group. The discovery was a tri- 
umph of “persistency, systematic approach, 
and scientific acumen,” says entomologist 
Robert Lane, a professor emeritus at UC 
Berkeley, who once proposed a Borrelia bug 
as the cause. 

“Tt’s a craaazy unusual bacteria,’ Stott says 
of foothill abortion, and not just because 
slime bacteria tend to swarm and glide. He 
and his colleagues call the pathogen Paja- 
roellobacter abortibovis, and it appears to 
foment disease in a unique way. Their un- 
published work suggests it lives in the im- 
mune system cells called macrophages, 
where it replicates slowly, without triggering 
symptoms in an infected cow. “It’s only if she 
is pregnant that you see disease,” Blanchard 
says. What kills the bovine fetus is its own 
immune response to the infection: When the 
animal’s immune system finally matures in 
the third trimester, it unleashes an attack 
that causes extensive damage to its tissue 
and organs, triggering abortion. 


Veterinarian Jeffrey Stott spent decades working 
toward the vaccine he is now testing. 
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The pajaroello tick 
carries the pathogen. 


GETTING A VACCINE for foothill abortion 
to affected beef producers has become a 
personal mission for Stott, who grew up on 
cattle ranches in rural Oregon. He and his 
fellow scientists cobbled together funding for 
the project, receiving $212,000 from the Cali- 
fornia Cattlemen’s Association and another 
$205,000 from a UC grant program. 

Field trials began in 2011 after pilot 
studies at UNR saw promising results from 
the experimental vaccine, which 
consists of live P. abortibovis pro- 
duced by infecting immunodefi- 
cient mice and extracting their 
spleen cells. Beef producers are 
a wary lot, but Talbot, the veteri- 
narian, helped recruit ranchers 
to participate, and the scientists 
have been testing the vaccine in 
4370 cows at about 10 sites, in- 
cluding Buellton, in the past 4 years. 

“We became very confident, very quickly, 
that this vaccine was spectacular,” Stott 
says. Based on their unpublished data, he 
says, it looks close to 100% effective, keep- 
ing cows free of infection and thus pro- 
tecting their fetuses for at least 18 months. 
However, ranchers must wait 60 days after 
inoculating heifers before breeding them, 
because of a risk that still-lingering bac- 
teria from the vaccine could trigger early 
fetal deaths, he says. 

UC Davis is now in talks with a small 
vaccine manufacturer that is interested 
in bringing the preventive to market. 
Stott and Blanchard plan to make the ex- 
perimental vaccine available to as many as 
6000 more heifers through continuing field 
trials at 50 other ranches over the next year. 

Fifth-generation cattleman Buck Parks, 
41, whose family ranch in Modoc County, 
California, has lost 20 to 35 calves to foot- 
hill abortion every year for decades, has no 
doubt that the vaccine is helping. Parks says 
he seriously considered selling his business 
at one point, but this spring, with almost 
half his breeding cows vaccinated in field 
trials over the past 3 years, only eight calves 
aborted. “I don’t want to compare it to cur- 
ing cancer, but it’s pretty close,” Parks says. 

Stott’s quest isn’t over, however, be- 
cause many basic mysteries about foothill 
abortion remain unsolved. Where is the 
pajaroello tick picking up the pathogen? 
Could mule deer, which are also hosts for 
the ticks but don’t seem to get the abortion 
syndrome, be slowly spreading the disease 
geographically? Once the calves of the 
California foothills are safe, Stott is eager 
to explore such questions. “That’s what I 
want to get back to doing,” he says. ™ 


Ingfei Chen is a freelance writer in 
northern California. 
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THE ENVIRONMENTAL FILM FESTIVAL IN THE NATION’S CAPITAL 


Brave new world 


Each March, the museums, embassies, libraries, and local theaters of Washington, D.C., play host to a festival of films that shed 
light on pressing environmental issues. Many of the screenings are free, and all are open to the public. A number of this year’s 
films sought to draw specific attention to climate change, a move organizers hoped would help the festival “serve as a signpost” 
on the route to the 2015 United Nations Climate Conference, which will be held later this year in Paris, France. Other films 
challenged audiences to confront the role of humans in wildlife extinction, invited viewers to think twice about our seemingly 
insatiable appetites for food and fuel, and encouraged the adoption of more sustainable ways of life. Read on to see what our 
reviewers thought of nine of this year’s films. 


The E-Waste Tragedy 


Reviewed by Marc Lavine 


Desktops, laptops, e-book readers, netbooks, 
tablets, cell phones, and flat panel, HD, and 
4K displays—every year, new gadgets call out 
to us, promising to enhance our lives. But 
what happens to the old and broken devices 
we discard, including the large appliances 
that contain and depend on electronics? 
Globally, dealing with garbage and waste is a 
problem, but e-waste adds its own particular 
challenges due to the toxicity and scarcity of 
some of the raw materials. Despite our best 
intentions, when we donate an old piece of 
equipment to a charity or dispose of it at an 
authorized recycling center, it often ends up 
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at dump sites in Africa or China. At best, 
it may just pile up and slowly decompose, 
but more likely it will be processed under 
hazardous and uncontrolled conditions to 
recapture the base metals or disassembled 
so that the old components can be fraudu- 
lently resold as new ones. In The E-Waste 
Tragedy, Cosima Dannoritzer takes us on 
a journey—one that crisscrosses countries 
and continents—as e-waste is removed from 
the legal and controlled recycling streams 
and finds its way to improper recycling and 
disposal sites. Despite improved legislation 
in some countries, including laws that incor- 
porate recycling fees into the purchase of 
new electronics and ban the exportation of 
e-waste, too many individuals and compa- 
nies find ways to profit at the expense of the 
health of the people and environment where 
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The E-Waste Tragedy 
Cosima Dannoritzer, 
director. 

Media 3.14, Spain, 2014. 
86 minutes. 


the waste ends up. Dannoritzer hopes that 
the behavior of consumers will change in 
order to reduce the quantity of waste being 
produced, but it seems more likely that the 
problem will only be solved when the costs 
associated with the human health risks, the 
increased scarcity of some raw materials, 
or the catastrophic failure of falsely labeled 
recycled parts drives the business and politi- 
cal communities to find better solutions to 
this growing problem. 
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Tiger Tiger 

George Butler, director. 
White Mountain Films, United 
States, 2015. 90 minutes. 


Tiger Tiger 


“Unless we do something, tigers are not 
going to survive.“ So says Alan Rabinowitz 
in one of the first lines of this documentary, 
which focuses on collecting data on the tiger 
population in protected areas of the Sundar- 
bans mangrove forest of India and Bangla- 
desh. This region may be one of the largest 
remaining contiguous populations of tigers 
and thus one of the best places to focus 
conservation efforts. Highlighting Rabinow- 
itz’s research trip, this film offers footage not 
only of tigers and their likely prey, captured 
through camera traps, but also of their great- 
est threat—the encroaching human popula- 
tion. On the basis of his interactions with the 
locals, it is clear that the people of these areas 
understand the potential long-term benefits 
of the tiger preserves. However, it is also 
apparent that there is a lack of governmental 
policy and protection, especially in Bangla- 
desh, as well as a large financial incentive 
driving people to kill the tigers. The mortal- 
ity of both human and feline are entwined 
throughout the film as we learn that Rabi- 
nowitz has leukemia—a fact that will likely 
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make this his final field expedition. Coupled 
with the uncertainty of efforts to save the 
tiger, a sense of melancholy permeates the 
film. But Rabinowitz’s push for tiger conser- 
vation lends hope for the survival of the 
species, and the movie intends for his exam- 
ple to propel others to take up the challenge. 


Just Eat It: A Food 
Waste Story 


Reviewed by Jennifer Sills 


Would it be possible to survive for 6 months 
on only discarded food? In Just Eat It, Grant 
Baldwin and Jen Rustemeyer set out to dojust 
that, scouring the dumpsters of Vancouver, 


Grant Baldwin 
considers a 
dumpster filled 
with discarded 
hummus. 
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Just Eat It: 

A Food Waste Story 

Grant Baldwin, director. 
Peg Leg Films, Canada, 2014. 
75 minutes. 


British Columbia, where they discover a veri- 
table cornucopia of wasted produce, dairy, 
prepared meals, and more. Interspersed with 
their experiences, the film provides vignettes 
about food waste at various junctures in the 
production system. We visit farms and facto- 
ries where between 20 and 70% of bananas, 
celery, and nectarines are thrown away 
immediately after harvest, simply because 
they don’t meet the stringent cosmetic stan- 
dards set by supermarkets. At a scrap farm 
outside Las Vegas, 2500 swine are fed on 
just 8% of the food waste generated by the 
city’s restaurants and hotels. We learn that a 
third of the food produced globally is wasted, 
including 15 to 25% of the food we purchase 
in our regular trips to the supermarket. The 
loss of the food itself is compounded by the 
waste of all the resources that contributed to 
its production. 

Why is the system so flawed? Some of the 
waste can be traced to economics. Farms can’t 
afford to harvest and ship food for which 
there is no demand. Food banks lack the 
infrastructure to accept donations in large 
quantities. However, the film maintains that 
culture is equally important. Unlike littering, 
wasting food is condoned in rich countries. 
A host with no leftover food is considered 
a failure. Just Eat It offers unsettling statis- 
tics—for example, the water required to 
produce one hamburger is the equivalent 
of a 90-minute shower—along with striking 
visuals—a field full of wasted celery stalks, a 
dumpster filled to the brim with containers 
of not-yet-expired hummus. After watching, 
you might just rethink going out to eat while 
those leftovers sit in the fridge another day. 
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Seeds of Time 

Sandy McLeod, director. 
Hungry, United States, 2014. 
77 minutes. 


Seeds of Time 


Reviewed by Valerie Thompson 


Buried beneath the permafrost on a remote 
Norwegian archipelago is a vault that holds 
the seeds of more than 4000 plant spe- 
cies. The Svalbard Global Seed Vault, as it 
is known, has a passionate advocate in the 
American agriculturalist Cary Fowler, who 
provides much of the narration for this 
compelling film. Through Fowler, we learn 
that vegetable crops in the United States 
now have less than 7% of the diversity that 
existed just a century ago—troubling news 
for a world in which staple food crops are in- 
creasingly vulnerable to disease and climate 
change. Maintaining crop diversity via seed 
banking, he argues, is essential to safeguard- 
ing our agricultural future. 

Woven throughout Fowler’s quest to pro- 
mote seed-banking efforts around the globe 
is the story of a group of indigenous Peru- 
vian farmers who are working to preserve 
native potato crops that are threatened by 
rising global temperatures. Their struggle 
to save this important food staple serves as 
a harrowing reminder of just how high the 
stakes truly are. 

In a fascinating flashback to the mid-20th 
century, we learn that the city of Leningrad 
once boasted the world’s largest seed bank. 
So dedicated was the group of scientists 
who tended the collection that they refused 
to eat any of its contents during the period 
of intense starvation that accompanied the 
protracted World War II military blockade 
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known as the Siege of Leningrad. By the end 
of the blockade, nine of the curators had 
died of starvation. This vivid example places 
today’s efforts in a broad historical context. 
The film ends by discussing the value of 
collecting and preserving wild relatives of 
crop species, which, through breeding and 
genetic engineering, may be able to confer 
drought and disease resistance to food crops. 
Ultimately, we will likely need to embrace 
both traditional and modern techniques for 
preserving plant biodiversity if we hope to 
maintain a secure global food system. 


Bikes vs Cars 


Reviewed by Valerie Thompson 


More than 20 million people live in Brazil’s 
Sao Paulo metro area—a fact that is not eas- 
ily forgotten during the hours of frustrating 
traffic congestion that define many resi- 
dents’ daily commute. As the title suggests, 
this thought-provoking film pits car-centric 
commuting against alternative modes of 
transportation, including bikes and, to a 
lesser degree, public transit. The film fea- 
tures a wide range of perspectives—an ur- 
ban planner in Sao Paulo, a bike advocate in 
Los Angeles, a city councilman in Toronto, 
a taxi driver in Copenhagen—most of whom 
embrace, or at least pay lip service to, the 
notion of improving bike-friendly infrastruc- 
ture. Some of the film’s most persuasive ar- 
guments focus on the dangers that cars pose 
to bystanders and cyclists. We learn, for ex- 
ample, that in Toronto, a pedestrian is hit by 
a car every 3 hours. In another scene, we wit- 


Bikes vs Cars 

Fredrik Gertten, director. 
WG Film, Sweden, 2015. 
88 minutes. 
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Peruvian farmers are working to preserve over 1500 
potato varieties that are native to the Andes Mountains. 


ness a large crowd gathered to pay tribute 
to a fallen bike rider in SAo Paulo, where an 
average of one cyclist is killed by a motor- 
ist every week. Nor does the film neglect the 
environmental arguments in favor of bike- 
based commuting. This point is driven home 
with a scene describing a weekend construc- 
tion project that shut down a 10-mile stretch 
of one of Los Angeles’s busiest highways in 
2011. Taking advantage of this natural exper- 
iment, UCLA scientists showed that air qual- 
ity near the highway improved by 83 percent 
that day, relative to comparable weekends. 
Broad in scope, and compellingly presented, 
this film opens an important dialogue about 
one possible solution to the commuting co- 
nundrum facing growing urban areas today. 


Cowspiracy: 

The Sustainability Secret 
Kip Andersen and 
Keegan Kuhn, directors. 
A.U.M. Films and First Spark 
Media, United States, 2014. 
91 minutes. 


Cowspiracy: The 
Sustainability Secret 


Reviewed by Sacha Vignieri 


If you were asked to identify the greatest hu- 
man impact on the planet, what would you 
answer? Perhaps our oil and gas consump- 
tion and the resulting release of climate- 
altering greenhouse gases? As filmmakers 
Kip Andersen and Keegan Kuhn success- 
fully argue in this film, the answer is actu- 
ally livestock production, which is not only 
the leading source of greenhouse gases but 
also the largest consumer of water and other 
natural resources and the leading driver of 
habitat destruction. Through a series of in- 
terviews with scientists, authors, and activ- 
ists, the film demonstrates how our current 
production of livestock for meat, eggs, and 
dairy is both a major driver of many of the 
environmental issues we face today and en- 
tirely unsustainable. Yet, there appears to be 
large-scale resistance to discussing its dam- 
aging impact on the environment and little 
thought given to the idea that we may need 
to change our behavior when it comes to the 
foods we eat. Andersen and Kuhn explore 
what might be underlying this resistance, 
including industry and political pressure, 
as well as our own denial. Unlike activities 
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like oil and gas production, diet choices are 
profoundly personal, and the film leaves one 
feeling as though we as individuals have 
great power to foment bottom-up changes 
with simple actions. In the end, the film is 
a noble and entertaining discussion of how 
we can begin to tackle one of the biggest, 
yet least addressed, human-caused environ- 
mental problems. 


Return of the River 

John Gussman and Jessica 
Plumb, directors. 

Elwhafilm LLC, United States, 
2014. 70 minutes. 


Return of the River 


Reviewed by Julia Fahrenkamp-Uppenbrink 


In 1914, the Elwha Dam on the Olympic 
Peninsula in Washington state began to 
generate power, bringing energy to the town 
of Port Angeles. Upriver, Glines Canyon 
Dam followed in 1927. Told in a poetic style 
and beautifully shot, this film shows how 
the dams, seen by many as an essential 
element of “conquering” the wild country- 
side, changed the river and the surround- 
ing ecosystem. Clams disappeared as fine 
sediment no longer washed down to the 
river’s mouth. Salmon and other migrating 
fish populations collapsed as the spawning 
grounds that were still accessible deterio- 
rated. Salmon hatcheries set up to mitigate 
falling fish numbers were a poor replace- 
ment for natural salmon runs. 

In the end, community engagement was 
key to the momentous decision to remove 
both dams. The film tells the story of the 
environmentalists, politicians, and members 
of the Lower Elwha Klallam Tribe who 
fought for decades for the removal of the 
dams and the restoration of the river. As the 
Elwha Dam turbines stopped on 1 June 2011, 
the camera zooms in on the last, handwrit- 
ten entry in the dam’s record book: “Elwha 
PH. Shut Down—Final.” Then the dams 
come crashing down. 

Replanting the former reservoirs will take 
several years; ecological changes will unfold 
over decades. But as salmon are traveling 
up the Elwha River again to spawn, Senator 
Bill Bradley captures the importance of this 
dam removal—the largest in history—when 
he says hope is “the great gift of the Elwha.” 

For more information on the ecological 
effects of dam removals, see J. E. O'Connor, 
J. J. Duda, G. E. Grant, Science 348, 496 
(2015). 
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Who Owns Water 


Reviewed by Nick Wigginton 


Tracking water as it flows downstream 
is usually a straightforward task. But as 
a river flows through communities and 
across political boundaries, who should be 
able to divert, pump, drink, or fish in that 
water? Does it change on one side of the 
riverbank versus the other? These are diffi- 
cult questions to answer, but as access to 
fresh water diminishes in some regions of 
the world, they will become more common. 

In Who Owns Water, filmmaker David 
Hanson and his brother Michael paddle 
down the Chattahoochee River in the 
southeastern United States, from its spring- 
fed origin through cities and national parks 
to the Gulf of Mexico. The often breathtak- 
ing views from the river change constantly, 
from canyons to massive reservoirs to 
flooded forests. At times, the water is rough 
enough to tip the most expert paddler. Yet 
at other times, the water levels are so shal- 
low, they are barely navigable by canoe 
at high water. The river also serves as the 
primary battleground between communi- 
ties, landowners, and three U.S. states, who 
are fighting over the dwindling freshwater 
reserves in this growing region. 

Along their journey, the brothers inter- 
view several residents spanning a range 
of interests, from environmentalists to 
oyster farmers, all of whom claim a reason- 
able stake in the river. In isolation, each 
of their stories about the changing river 
seems to require simple solutions and 
often justifiably direct blame upstream or 
shift consequences downstream. As the film 
highlights, however, the river is more than 


Who Owns Water 

David Hanson, Michael 
Hanson, Andrew Kornylak, 
directors. 

MoDoc and Fourmile Media, United 
States, 2014. 50 minutes. 


Kendall Shoelles harvests oysters in Apalachicola Bay. 
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a shared resource connecting communi- 
ties in one watershed in one small corner 
of the world. Answering the question of 
who owns water remains difficult, but it is 
a question that we still need to ask. 


Gambling on Extinction 4 & 
Jakob Kneser, director. { 
Real to Reel Productions Inc., 

Germany, 2014. 

52 minutes. 
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Gambling on 
Extinction 


Reviewed by Barbara Jasny 


The film Gambling on Extinction describes 
what many see as a war on the rapidly 
declining populations of rhinos and 
elephants in Africa. Certainly, the facts 
presented are horrifying: If poaching 
continues at its current rate, there could be 
no more wild elephants within two decades. 
However, what makes the film memorable 
is not so much that it presents new data but 
that it reveals the dimensions of the prob- 
lem. Investigations of fly-covered carcasses 
reveal that parts have been carved from 
still-living animals and that the equip- 
ment used indicts outsiders who cross 
borders to decimate native populations. To 
the poachers, the lives of the animals are 
irrelevant; they just see them as the “white 
man’s toys.” Through investigative journal- 
ists and undercover agents, we meet rhino 
horn users (who believe the myths that this 
material, made from the same material as 
fingernails, has healing properties) and an 
ivory carver. We see sellers who have cata- 
logs of illegal animal parts for sale and who 
disguise the horns and tusks as wood to 
escape customs officials. 

Much of the ivory loads from African 
ports are sent to China, where carved ivory 
is considered to be an investment whose 
value can only go up as more and more 
elephants are killed. Part of the problem in 
China is that there is a legal supply, based on 
government stockpiles, which acts as a cover 
for the illegal trade. However, in a recent 
survey, 60% of the Chinese respondents said 
that they would not have the products if 
the government instituted a universal ban. 
Ten countries have destroyed their ivory 
stockpiles, demonstrating a growing global 
commitment to conservation efforts. As 
Grace Gabriel from the International Fund 
for Animal Welfare declares, “There are 
things money shouldn’t buy.” 
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Understanding friction in layered materials 


A scanning probe technique provides a clearer picture of friction at the nanometer scale 


By Kenneth M. Liechti 


s we move to the digital age, we may 

forget that a lot of correspondence 

was accomplished by pencil and pa- 

per. Nonetheless, when we do still 

write with “lead” pencils, we are 

making use of the weak interlayer 
forces in graphite to strip off tiny flakes of 
graphite and deposit them on paper in a 
complex interplay between adhesion and 
friction. Understanding the nature of such 
weak interlayer forces and the attendant 
opportunities afforded by such understand- 
ing are becoming more important with the 
advent of nano- and microelec- 
tromechanical systems (NEMS/ 
MEMS); machines whose moving 
components are at the nanometer 
and micrometer scales embodied 
in their names. Because the sur- 
face area of such components is 
much larger than their volume, 
surface forces (for example, van 
der Waals, electrostatic, and cap- 
illary forces) become the domi- 
nant loading to be considered 
in the design and reliability of 
NEMS and MEMS devices (J). 
On page 679 of this issue, Koren 
et al. (2) have devised an inno- 
vative experiment that quanti- 
fies our day-to-day experience 
with lead pencils, but with much 
higher fidelity than we might 
expect. They show that two ad- 
jacent planes of carbon atoms 
can be sheared in highly ordered 
pyrolytic graphite (HOPG) and 
observe the interplay between ad- 
hesion and friction forces in this 
layered material. This experiment 
with HOPG can be considered as a 
model for two planes of graphene 
or other so-called two-dimen- 
sional materials sliding over one 
another. The findings are exciting 
because they point to new ideas 
for data storage at small scales, as 
well as novel actuation concepts 
in electromechanical devices. 


Center for the Mechanics of Solids, Structures 
and Materials, Department of Aerospace 
Engineering and Engineering Mechanics, 
University of Texas at Austin, Austin TX 78712, 
USA. E-mail: kml@mail.utexas.edu 
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The challenge of measuring relatively 
weak forces between components has been 
addressed by a number of instrumented 
indentation and lateral loading devices, 
which all make use of probes that are 
brought into contact or near contact with 
the target surface while the normal and lat- 
eral (friction) load on and displacement of 
the probe are measured. With sharp enough 
probes, it is possible to observe oscillations 
in the lateral force-displacement response 
whose maxima and minima, respectively, 
coincide with the onset of slipping and 
attachment of the probe to the target sur- 
face and were commensurate with the in- 


Taking a measure of friction 
Illustrating the method of Koren et al. with a toy model of a layered 
system under lateral shear and torque. 


Lateral shear 
3 
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Lateral shear 
ee 


teratomic spacing of the mica surface that 
was being probed (3). The novelty of the 
experiment by Koren e¢ al. lies in the use of 
an atomic force microscope to apply a lat- 
eral shear force to disks (mesas) of HOPG. 
The disks, 50 nm thick and with radii that 
ranged from 50 to 300 nm, were sheared 
along the basal plane of the graphite, about 
10 nm from their bases. A classic frictional 
lateral load-displacement response was 
observed, which was largely reversible ex- 
cept for dynamic dissipative events that oc- 
curred during sliding. They can be seen as 
slight differences in the force-displacement 
response during sliding in opposite direc- 
tions. These latter events were 
related to slight rotational mis- 
alignments between the graph- 
ite lattices above and below the 
slip plane. In addition, in a sec- 
ond set of experiments, when a 
torque was applied to the me- 
sas, stable orientations were ob- 
served every 60%, in registration 
with the sixfold symmetry of the 
hexagonal carbon atom lattice. A 
third set of experiments, with a 
structure exhibiting mechanical 
bistability, demonstrated the po- 
tential to produce cell structures 
for memory devices that are en- 
tirely mechanical in operation. 
The various equilibrium states 
of these tiny cells could then be 
used as switches in such memory 
devices. 

The idea behind the method 
developed by Koren et al. can 
be illustrated more easily if we 
consider, for example, a toy sys- 
tem consisting of ping-pong balls 
glued together in a hexagonal 
pattern (close packed) in two 
planes that are placed on top of 
one another (see the figure). The 
in-plane attachment is much 
stronger than the interaction be- 
tween the planes, just as in lay- 
ered, two-dimensional materials. 
One can imagine sliding or ro- 
tating the planes relative to one 
another and finding a series of 
equilibrium positions. 

Another opportunity afforded 
by the experiment of Koren et 
al. is that it may present a close 
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link to multiscale modeling from atomistic 
to continuum scales in that the interface or 
slip plane was free of any contamination 
and the geometry was relatively simple, 
which is not usually the case for contact 
pairs. Although atomistic or molecular 
analyses have provided insights into defor- 
mation and dissipation mechanisms and 
other effects that are difficult to separate 
out experimentally, one-to-one comparisons 
are limited by the high shear velocities and 
nanometer-scale probe tips that are cur- 
rently accessible to analysis. Some prog- 
ress in this direction has been provided by 
quartz microbalance experiments (4) and 
molecular analyses (5) of krypton monolay- 
ers sliding on gold, where the close agree- 
ment between simulation and experiment 
identified mechanisms for phononic con- 
tributions to friction. Molecular self-assem- 
bled monolayers provide another, slightly 
more complex avenue for linking experi- 
ments and analysis in addition to having 
technological relevance to MEMS. The is- 
sue of time and spatial scaling in atomistic 
analyses of friction between self-assembled 
monolayers has been identified as one of 
the current challenges in single asperity 
friction (6). One possible way of addressing 
this issue is through hybrid analyses with 
schemes that combine molecular analyses 
with phenomenological potentials (7, 8) 
that have allowed typical sliding speeds to 
be accessed. 

Koren et al. have come up with a novel 
experiment that provides a better under- 
standing of friction in layered materials. 
Although graphite was the subject of this 
study, the extension to the plethora of two- 
dimensional materials that are becoming 
available is clear. The authors have also 
provided much food for thought in develop- 
ing potential applications of the observed 
phenomena to novel memory devices and 
machines. & 
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INFECTIOUS DISEASE 


More than Just bugs in spit 


Advances in imaging help to explain tuberculosis treatment 
failures and develop better drugs 


By Clifton Barry’? 


he development of new drugs for tu- 

berculosis (TB) is not for the faint of 

heart. In addition to all the typical 

hurdles to developing a new anti- 

infective, additional complications 

result from the complex pathology 
induced by the disease. Because the bacil- 
lus is surrounded by a thick, waxy coat, 
hit rates in whole-cell screens are typically 
low; the resulting hits are typically so lipo- 
philic that they do not offer good chemical 
starting points. Despite extensive research 
efforts, no TB drug has been developed us- 
ing target-based approaches. Advances in 
imaging at both a macroscopic and micro- 
scopic level offer new insights and strate- 
gies for getting urgently needed new drugs 
to the clinic. 

TB is a disease of the lungs: It is trans- 
mitted by inhalation of an aerosol droplet 
containing the bacterium, which implants 
in small alveoli and then invades and mul- 
tiplies in the interstitial spaces in the lung. 
Here, it forms granulomas—that is, nodules 
of immune cells recruited as the body at- 
tempts to wall off the invading bacteria. 
Eventually, the granulomas undergo necro- 
sis to form cavities and leak out into the 
airways, spilling bacteria into the lungs and 
inducing coughing to enable transmission 
to a new host. A single patient with TB can 
have dozens of individual lesions, all at dif- 
ferent stages of maturation. 

TB therapy is often very protracted. 
Standard medical care for pulmonary TB 
patients globally involves 6 months of treat- 
ment. Although full treatment is successful 
in 95% of patients, in reality many patients 
fail to complete such a long course of ther- 
apy; mobilizing the resources to ensure 
full compliance is nearly impossible in re- 
source-poor countries where the disease is 
endemic. Shorter courses of treatment are 
one of the major goals of global research 
efforts because they would al- 
low for more supervision, less 
recurrent disease, and corre- 
spondingly less opportunity for 
developing drug resistance. 

U.S. and European regulatory 
bodies mandate that end points 
in phase 3 clinical trials of new 
TB drugs focus on achieving dis- 
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ease-free status in patients 1 year after com- 
pletion of treatment. Because determining 
relapse-free cure requires such a long time, 
end points in phase 2 clinical trials focus 
on killing bacteria in expectorated sputum 
over the first few months of treatment. It 
has long been clear that the end points in 
these two different stages of clinical evalua- 
tion are not well correlated, as emphasized 
again recently by the failure of three large 
phase 3 studies that aimed to shorten the 
standard 6-month regimen to four months 
by including a fluoroquinolone as part of a 
four-drug regimen (7-3). Despite achieving 


“All these exciting 
techniques have tremendous 
potential to accelerate 

TB drug development by 
improving the quality 

of drugs and drug 
combinations used in 
clinical trials to target 
specific lesion types.” 


earlier sterilization of sputum than conven- 
tional therapy in preceding phase 2 studies 
and in the phase 3 trials themselves, more 
patients in the 4-month fluoroquinolone 
arms suffered relapse after treatment was 
discontinued. The fact that these patients 
were obviously not cured shows that bac- 
teria in the sputum do not tell a complete 
story. Nor would one expect them to do so, 
given that only certain lesions produce bac- 
teria that are represented in the airways. 
Curing all patients with a shortened TB 
treatment regimen requires developing an 
understanding of what leads to a relapse— 
defined as recurrent disease 
with the same bacterial strain 
after apparent initial cure—in 
a subset of treated patients. 
There are two main hypotheses 
for why relapse occurs. One 
argues that relapse is due to a 
subset of bacteria with unique 
physiology (imposed by drugs, 
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Density associated with TB lesions 


Air in a cavitary lesion 


2D axial slice of the CT scan 


Abnormal CT density in the lungs of a patient with pulmonary TB. A three-dimensional map generated from a CT scan shows 
abnormal density in TB lesions at millimeter resolution. These computationally segmented lung regions can be precisely quantified 
and monitored over time, allowing a real-time quantitative evaluation of response to a given drug regimen. Produced in Amira 5.6.0 


(FEI Visualization Sciences Group). 


the immune system, physical properties of 
the local environment, or subpopulations 
that arise stochastically) (4). The other 
maintains that relapse is due to bacteria 
in an area of unique pathology where their 
exposure to antibiotics is limited (5). Ex- 
perimental data exist in support of both 
theories, and it is likely that each has some 
validity. Recently, translational studies have 
started to move beyond simple measure- 
ments of bacteria in sputum to elucidate 
the metabolism and status of bacteria that 
are buried in lesions that are not shedding 
bacteria into the sputum. 

Advances in radiologic imaging and com- 
putational image analysis enable precise 
quantitation of all types of lesion pathol- 
ogy present in the lungs of TB patients. 
Computed tomography (CT) can generate 
three-dimensional (3D) maps of every voxel 
of the lung at millimeter resolution (see 
the figure). In a recent study, changes in 
abnormal CT lung density after 2 months 
of treatment were shown to accurately pre- 
dict how patients would fare 2 years later, 
after they had completed treatment (6). 
Positron-emission tomography (PET) scan- 
ning involves administering a radioactive 
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probe and collecting information about the 
location of positron emission and hence lo- 
cation of the probe. TB lesions (and likely 
lesions from many pathogens) are sites of 
intense inflammation; quantification of 
changes in the extent of inflammation dur- 
ing the first 2 months of treatment pre- 
dicted ultimate treatment outcome just as 
well as CT changes (6). 

Both approaches have also been applied 
to nonhuman primates, offering a smooth 
pathway between a realistic animal model 
and clinical trials (7). The initial studies in 
humans were done 2 months after treat- 
ment started, but results in rabbits suggest 
that radiographic changes as a result of 
chemotherapy are quantifiable as early as 
1 week after initiation. Furthermore, PET 
probes that report on processes specific to 
TB are under development, potentially pro- 
viding a quantitative means of assessing 
bacterial numbers in specific lesions (8, 9). 

The search for markers to determine 
when treatment is complete has also been 
advanced by looking at changes in circulat- 
ing levels of immunologically active mol- 
ecules in blood from patients on treatment 
(10), as well as by analyzing changes in me- 
tabolites in urine during the first month of 
treatment (77). Both approaches offer poten- 
tial surrogate markers that might be used to 
rapidly cycle through new drug compounds 
and combinations in order to identify the 
most promising drug candidates. Larger 
cohorts of patient samples, preferably from 
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trials designed to attempt treat- 
ment shortening, are urgently 
needed for all these techniques 
to assess the predictive validity of 
such markers. 

Imaging mass_ spectrometry 
(IMS) is another technique that 
advances the understanding of 
persisting bacterial populations. 
IMS creates 2D ion maps from ex- 
cised tissue samples, allowing the 
evaluation of drug distribution 
across relatively large areas. The 
lung has long been thought to be a 
“blood organ,” suggesting that the 
circulating concentration of drug 
in the bloodstream approximates 
that in TB lesions. Recent work 
with IMS in animal models of dis- 
ease clearly shows this assump- 
tion to be false in most cases (5, 
12); some drugs penetrate lesions 
poorly or not at all, whereas oth- 
ers freely diffuse into and out of 
affected areas. Better knowledge 
of the physicochemical proper- 
ties of molecules that favor lesion 
penetration to get to where the 
bacteria are will help to achieve 
successful shorter TB therapies. These sorts 
of tools may also find application in other 
diseases with potential latent phases associ- 
ated with distinct pathology. 

All these exciting techniques have tre- 
mendous potential to accelerate TB drug 
development by improving the quality 
of drugs and drug combinations used in 
clinical trials to target specific lesion types. 
Whole-body imaging can also affect the 
speed and accuracy of clinical trials by 
providing quantitative end points. For ex- 
ample, serial CT allows changes in bacterial 
load and hence in the volume of disease in 
specific lesion types to be measured. At the 
same time, changes in the extent of inflam- 
mation can be monitored with PET. These 
tools could be as transformative in TB clini- 
cal trials as viral load and CD4 counts were 
to HIV/AIDS. 
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INORGANIC CHEMISTRY 


A closer mimic of the oxygen evolution 
complex of photosystem II 


An inorganic cluster replicates many of the structural aspects of the complex that 
photosplits water and powers photosynthesis 


By Licheng Sun 


he oxygen evolution complex (OEC) 

in photosystem II (PSII) catalyzes the 

photosplitting of water. The resulting 

electrons and protons are then ulti- 

mately used to create adenosine tri- 

phosphate to convert carbon dioxide 
(CO,) into organic compounds. An artificial 
catalyst that mimics the small inorganic 
OEC cluster within the much larger PSII 
enzyme could be used to create fuels such 
as hydrogen from water via sunlight (7). Al- 
though tremendous efforts have been spent 
on artificial photosynthesis systems (2), syn- 
thetic water oxidation catalysts that closely 
mimic the structure and function of OEC in 
PSII have been very limited. Now, on page 
690 of this issue, Zhang et al. (3) describe the 
closest structural mimic of the OEC in PSII 
reported to date. 

The naturally occurring OEC contains 
a cluster of manganese, calcium, and oxy- 
gen, Mn,CaO,. Several crystal structures 
of PSII have been reported since 2001 (4, 
5), but these had insufficient resolution to 
elucidate the atomic structure of the OEC. 
In 2011, Shen and co-workers re- 
ported a crystal structure of PSII 
at a resolution of 1.90 A by x-ray 
diffraction using synchrotron ra- 
diation (6). Although much de- 
tailed structural information was 
revealed, extended x-ray absorp- 
tion fine structure (EXAFS) studies 
showed that the Mn cations in the 
OEC are easily reduced (7), leading 
to slight differences in the Mn-Mn 
distances determined by different 
techniques. 

Very recently, Shen and collabo- 
rators (8) used femtosecond x-ray 
pulses to obtain a high-resolution 
(1.95 A) “radiation damage-free” 
crystal structure of PSII in the dark- 
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“Making the synthetic 
models work as real water 
oxidation catalysts will 
require consideration of 
other structural features.” 


stable S1 state. In this Mn,CaO, cluster (see 
the figure, panel A), five oxo-bridged O at- 
oms form a twisted cubane with three Mn 
atoms and 1 Ca atom on the corners, and a 
fourth Mn atom outside the cubane held by 
two of the five oxo bridges. Because one of 
the oxo-bridged oxygens, O05, is much farther 
away from the Mn atoms than are the other 
oxo-oxygen atoms, the O5 is likely a hydrox- 
ide ion instead of a normal oxygen dianion. 
Thus, O5 may serve as one of the substrate 
O atoms and be involved in the O-O bond 
formation step (9, 10). 

Artificial molecular water oxidation cata- 
lysts based on other transition metal com- 
plexes [for example, ruthenium (JJ)] can 
be very active in chemical, electrochemical, 


Natural versus artificial 
The structure of the Mn,CaO, cluster in photosystem II (A) (6) and the structure of the synthetic 
Mn,CaO, cluster (B) prepared by the group of Zhang et al.; ‘Bu is tert-butyl and Me is methyl. 
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and even photochemical reactions, but effi- 
cient biomimetic water oxidation catalysts 
based on Mn complexes have so far shown 
very limited success. In 2011, Agapie and co- 
workers synthesized a Mn,CaO, cluster that 
structurally mimics the subsite of the OEC 
in PSII (12). However, the Mn,CaO, cluster 
that Zhang et al. have synthesized has the 
core cubane structure Mn,CaO, linked to a 
dangling Mn via one oxo bridge on the cu- 
bane (see the figure, panel B) and structur- 
ally mimics the full site of the OEC in PSII 
more closely. 

The x-ray crystal analysis of this syn- 
thetic Mn,CaO, cluster revealed structural 
similarities to the OEC of PSII. First, both 
structures have the core cubane Mn,CaO, 
and the dangling Mn atom. Second, for the 
first coordination sphere, both structures 
have one nitrogen-based ligand, with 6 car- 
boxylates in the OEC and 10 carboxylates 
in the synthetic model. Third, the valence 
states of the Mn ions—Mn (II), MncIv), 
Mn(IV), and Mn(II)—are identical in both 
structures. 

The main difference in structural fea- 
tures is that in the OEC of PSII, there are 
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two oxo bridges (O5 and O6 in the figure, 
panel A) between the dangling Mn and the 
core cubane, but in the synthetic cluster 
there is only one oxo bridge (O5’ in the fig- 
ure, panel B). The lack of the second oxo 
bridge (06°) might be a fatal weakness for 
this synthetic model to work as a real cata- 
lyst for water oxidation. In the OEC of PSII, 
the O-O bond formation involves 05, and 
the second oxo bridge O6 can keep the dan- 
gling Mn at the correct position and redox 
potential after O5 has left during the O-O 
bond formation. In contrast, the dangling 
Mn in the synthetic model might detach 
from the core cubane after O5’ has left dur- 
ing the O-O bond formation if it follows a 
similar reaction pathway, which would in- 
activate the catalyst. Another difference is 
the lack of a potentially open coordination 
site in the synthetic model because of the 
short bond length between O5’ and the Mn 
(2.28 A) on the corner of the core cubane, 
relative to the related bond length of 2.7 A 
in the OEC of PSII. 

The work of Zhang et al. constitutes an 
important step toward a full structural 
mimic of the OEC in PSII. Further improve- 
ment of synthetic model complexes must 
address the second oxo bridge between the 
dangling Mn and the core cubane. Making 
the synthetic models work as real water oxi- 
dation catalysts will require consideration 
of other structural features. For example, 
longer distances between the corner Mn(III) 
and O05’, and between O5’ and the dangling 
Mn(III), need to be enforced via rational li- 
gand design to provide a potentially open 
site for substrate water molecules to coordi- 
nate. This change also would allow the in- 
troduction of a pendent base in the second 
or third coordination sphere to facilitate the 
proton-coupled electron-transfer process 
during the O-O formation. & 
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NEURODEVELOPMENT 


“RASopathic” astrocytes 
constrain neural plasticity 


The cellular pathology of a complex neurodevelopmental 


disorder is teased apart 


By Lei Xing, Xiaoyan Li, William D. Snider 


ver the past decade, mutations in 

genes encoding RAS family members, 

other components of an intracellular 

signaling cascade that RAS controls, 

and proteins that modify the cascade 

have been recognized as causes of de- 
velopmental syndromes. Collectively, these 
syndromes are often referred to as “RASopa- 
thies.” Not surprisingly, RASopathies have 
numerous manifestations, including pro- 
pensity to cancer, craniofacial abnormalities, 
cardiac defects, cutaneous abnormalities, 
neurodevelopmental delay, and varying de- 
grees of cognitive dysfunction. Uncovering 
the causes and developing treatments for 
the neurodevelopmental abnormalities are 
a challenge because of the myriad cellular 
elements in the brain and the complexity of 
nervous system function. A recent study by 
Krencik et al. (1) takes a major step toward 
identifying the cellular pathology underly- 
ing Costello syndrome, a RASopathy that is 
characterized by delayed development, cra- 
niofacial and heart problems, and cognitive 
impairment. The latter appears to be linked 
to abnormal development and function of a 
population of nonneuronal cells (astrocytes) 
in the brain. 

The extracellular signal-regulated kinase/ 
mitogen-activated protein kinase (ERK/ 
MAPK) signaling cascade (also known as the 
RAS-RAF-MEK-ERK pathway) is among the 
most important cellular pathways, transduc- 
ing effects of external signals and regulating 
key cellular responses such as proliferation, 
differentiation, and morphological develop- 
ment. Interestingly, most RASopathies ex- 
hibit gain of function in ERK signaling with 
exact clinical manifestations varying with 


Altering neural plasticity. Astrocytes derived from 
fibroblasts of Costello syndrome patients harboring the 
HRAS@5 mutation exhibit premature differentiation, 
increased proliferation, and larger size compared to 
controls. They also express more extracellular matrix 
components, resulting in the formation of perineuronal 
nets around interneurons (in a mouse model of Costello 
syndrome). Premature formation of perineuronal nets 
may accelerate neuron maturity and “close” the critical 
period of plasticity. 
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the specific mutation (2). Dysregulation of 
ERK signaling in RASopathy mouse models 
leads to premature differentiation and over- 
production of astrocytes (3, 4). Astrocytes 
and other glial cells, including oligodendrog- 
lia, are thought to represent more than 50% 
of the cellular elements in the human brain 
and are increasingly recognized as having 
important roles in regulating the develop- 
ment of brain circuits (5, 6). 


Hypertrophic astrocytes / ¢ 
(mutant HRAS) AN 


Perineuronal 
net _ 


| Interneuron 


Premature closure of 
“critical period” for synaptic plasticity 
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Costello syndrome is caused by at least 
15 different mutations in the gene encoding 
the RAS family member HRAS (7). Disease- 
associated mutations cause constitutive acti- 
vation of this protein. Krencik et al. explored 
the function of one particular mutation, 
HRAS®”S (in which glycine at amino acid 12 
is changed to serine) in gliogenesis. From 
dermal fibroblasts derived from Costello syn- 
drome patients, the authors generated hu- 
man induced pluripotent stem cells (iPSCs). 
These iPSCs were then induced to give rise 
to neuroepithelial cells, the precursor to both 
neurons and glia. When these neuroepthelial 
cells were exposed to epidermal growth fac- 
tor and fibroblast growth factor 2 (FGF2), 
which act through the RAS-ERK signaling 
pathway, they expressed astrocytic markers 
indicative of glial differentiation. Fewer con- 
trol neuroepithelial cells expressed astrocytic 
markers, indicating that mutant HRAS leads 
to premature astrocyte differentiation. 

Astrocytes expressing HRAS°”S were ap- 
parently healthy, hyperproliferative, mark- 
edly larger in size than controls, and also 
displayed more elaborate neural processes. 
The hypertrophic nature was also observed 
when these astrocytes were cultured on 
mouse hippocampal organotypic slices (a 
more realistic environment than dissoci- 
ated cells), as well as in a mouse model in 
which a different Costello syndrome mutant, 
HRAS®”Y (where glycine at amino acid 12 is 
changed to valine), was expressed in astro- 
cytes. These findings are in line with a recent 
study in the fly Drosophila melanogaster 
showing that FGF signaling elicits similar 
morphological changes in astrocytes. Impor- 
tantly, astrocytes are closely associated with 
developing synapses (astrocytes can create a 
tripartite structure at neuronal synapses) (8). 
The implication is that hypertrophy of astro- 
cytes could produce an enlarged brain and 
might account for the macrocephaly seen in 
many Costello syndrome patients. 

Astrocytes derived from Rett syndrome 
and Down syndrome patients have been ob- 
served to have growth-inhibiting effects on 
developing neurons (9, 10). By contrast, as- 
trocytes expressing the mutant HRAS®°”S se- 
creted factors that were notably more potent 
than those released by control astrocytes in 
promoting neuronal process outgrowth and 
synaptic development (J). During normal 
mammalian brain development, astrocytes 
release constituents of the extracellular ma- 
trix that influence maturation of synapses 
and other neurodevelopmental processes. 
Consistent with premature astrocyte dif- 
ferentiation, transcripts and proteins of 
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Perineuronal net. A perineuronal net of extracellular 
matrix protein (magenta) surrounds an interneuron in the 
visual cortex of a mouse model of Costello syndrome. The 
elaborate processes of astrocytes are shown in black. 


extracellular matrix components (such as 
proteoglycans and fibrous collagens) were 
markedly increased in astrocytes derived 
from HRAS°”S iPSCs compared to controls. 
The expression of regulators and effectors of 
the RAS-ERK signaling pathway and regula- 
tors of synaptogenesis was also increased. 
The transcription factor SNAI2 is required 
for the increased expression of the extracel- 
lular matrix constituents. 

A critical function of glial-secreted pro- 
teoglycans is to form “perineuronal nets” 
that influence neural plasticity regulated by 
experience (JI) (see the first figure). They 
are found around neuronal cell bodies and 
neurites and stabilize established neuronal 
connections by restricting neuronal morpho- 
logical plasticity. Importantly, Krencik et al. 
found that higher amounts of perineuronal 
nets accumulate around inhibitory neurons 
expressing parvalbumin in the visual cortex 
of the HRAS°”’Y mouse model of Costello 
syndrome (compared to controls) at 14 days 
after birth. Parvalbumin-expressing neurons 
control the “critical period” in early brain de- 
velopment, a window of dynamic synaptic re- 
arrangement and change in synaptic efficacy 
(11). Accelerated deposition of perineuronal 
nets around these neurons could enhance 
their maturation, resulting in premature 
critical-period closure and altered neuronal 
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output (see the second figure). Indeed, the 
authors observed changes in synaptic po- 
tentials onto cortical excitatory neurons in 
the HRASS”Y mouse model at later develop- 
mental stages. Although this study did not 
investigate potential alterations of brain 
functions, it is plausible that disrupting syn- 
aptic development during the critical period 
may produce long-term effects on neuronal 
circuits. Overall, the results of Krencik et al. 
suggest that “RASopathic” astrocytes may 
have potent effects on neural plasticity that 
contribute to cognitive dysfunction. A vari- 
ety of standard paradigms could be used to 
test this idea, including formation of ocular 
dominance columns, tests of the physiologi- 
cal effects of monocular deprivation, and 
studies of patterning in the sensory cortex. 
The results of Krencik et al. are important 
not only for understanding pathogenesis, but 
also for developing effective treatments. Ow- 
ing to the importance of the RAS-ERK path- 
way in cancer, many inhibitors of pathway 
components have been developed for treat- 
ing various malignancies. In mouse models 
of RASopathies, MEK (MAPK kinase) inhibi- 
tors reversed cardiac defects (12) and some 
abnormalities of brain development, includ- 
ing increased gliogenesis (3). The use of RAS 
and MEK inhibitors in a Noonan syndrome 
mouse model demonstrated reversibility of 
synaptic dysfunction without obviously al- 
tering the behavior of control animals (13). 
However, there are many reasons for caution, 
including that RAS-ERK signaling regulates 
higher-order processes involved in learning 
(1/4), and that other signaling pathways are 
likely activated downstream of pathogenic 
RAS (15). The implication that there is pre- 
mature closure of critical periods in brain 
development due to hyperactive HRAS pro- 
vides another plasticity paradigm that might 
be further explored in preclinical studies. & 
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CONSERVATION 


Committing to ecological restoration 


Efforts around the globe need legal and policy clarification 


By Katharine Suding,™ Eric Higgs,” 
Margaret Palmer,’ J. Baird Callicott,* 
Christopher B. Anderson,> Matthew 
Baker,*® John J. Gutrich,’ 

Kelly L. Hondula,® Matthew C. LaFevor,® 
Brendon M. H. Larson,’ Alan Randall,’°™ 
J.B. Ruhl,” Katrina Z. S. Schwartz” 


t the September 2014 United Nations 

Climate Summit, governments ral- 

lied around an international agree- 

ment—the New York Declaration on 

Forests—that underscored restoration 

of degraded ecosystems as an auspi- 
cious solution to climate change. Ethiopia 
committed to restore more than one-sixth of 
its land. Uganda, the Democratic Republic of 
Congo, Guatemala, and Colombia pledged to 
restore huge areas within their borders. In 
total, parties committed to restore a stagger- 
ing 350 million hectares by 2030. 

The ambition affirms restoration’s grow- 
ing importance in environmental policy. 
These new commitments follow the 2010 
Aichi Convention on Biological Diversity (to 
restore at least 15% of degraded ecosystems 

globally) and the 2011 Bonn 
POLICY Challenge (to restore 150 mil- 

lion hectares). Particularly when 
accompanied by policies to reduce further 
losses (as in the New York Declaration), res- 
toration of such magnitude holds promise to 
address global environmental concerns. 

Achieving this promise requires careful 
thought about how we restore ecosystems 
(1, 2). We outline four core principles of 
scientifically based, workable, and compre- 
hensive restoration (3) that can provide ap- 
propriate best practice guidelines in legal, 
policy, and planning efforts. 

There is little question that ecological 
restoration can provide substantial benefits 
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Shits and engage? sO 
Four principles for planning restoration. The use 
of four principles identifies trade-offs in the planning 
process and the extent of departure from the full 
opportunities presented by comprehensive ecological 
restoration [example after (25)]. 


that enhance quality of life (4). A consider- 
able body of science suggests that restora- 
tion can guide establishment of complex 
self-sustaining interactions between biota, 
biophysical features, and processes that 
compose an ecosystem (5, 6). The science 
also emphasizes the challenging nature 
of the endeavor: Our interventions rarely 
achieve full recovery, and uncertainty is 
to be expected in dealing with natural re- 
covery processes (7, 8). Continuing envi- 
ronmental change further challenges the 
notion of recovery (9). 

Some have thus questioned whether dec- 
larations of intent to restore will in fact re- 
sult in substantive restoration of degraded 
land (0, 11). Others have cautioned that 
these declarations may spur actions that 
compromise biodiversity: for instance, by 
replacing ancient grassy biomes with forest 
plantations (72) or by planting species in 
climatic zones where they may not persist 
(13). Others emphasize that a focus on one 
specialized goal (e.g., climate change miti- 
gation) might not deliver intended benefits 
because of complexity in ecosystem dynam- 
ics in ways and over time scales not fully 
understood (14, 15). 

Specialized programs such as compensa- 
tory mitigation, endangered species conser- 
vation, and ecosystem service delivery can 
be a useful contribution to—but are not 
synonymous with—ecological restoration 
(16, 17). Such distinctions are not trivial be- 
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cause projects undertaken in the name of 
restoration may in fact be something differ- 
ent and, in many cases, have been demon- 
strated to achieve neither restoration nor 
their intended purposes (17, 18). Delivery 
of diverse benefits will depend on how on- 
the-ground efforts are conceived and imple- 
mented (7, 8). Avoiding mistakes on a grand 
scale requires clear practice principles (J0). 


FOUR PRINCIPLES. We advocate consider- 
ing four principles when planning restora- 
tion. The degree to which each principle 
is achievable will vary on the basis of so- 
cial and ecological context. By taking into 
account these comprehensive principles, 
trade-offs inherent in specialized projects 
are avoided, which increases the prospect 
of sustainable and valuable overall out- 
comes (see the figure). 

I. Restoration increases ecological integ- 
rity. Restoration initiates or accelerates 
recovery of degraded areas by prioritizing 
the complexity of biological assemblages, 
including species composition and repre- 
sentation of all functional groups, as well 
as the features and processes needed to sus- 
tain these biota and to support ecosystem 
function (3, 4). 

2. Restoration is sustainable in the long 
term. Restoration aims to establish systems 
that are self-sustaining and resilient; thus, 
they must be consistent with their environ- 
mental context and landscape setting. Once 
a restoration project is complete, the goal 
should be to minimize human intervention 
over the long term. When intervention is 
required, it should be to simulate natural 
processes that the landscape no longer pro- 
vides (e.g., fire or invasive species removal) 
or to support traditional practices of local 
communities (8, 9). 

3. Restoration is informed by the past 
and future. Historical knowledge, in its 
many forms, can indicate how ecosystems 
functioned in the past and can provide 
references for identifying potential future 
trajectories and measuring functional and 
compositional success of projects (19). 
However, the unprecedented pace and spa- 
tial extent of anthropogenic changes in the 
present era can create conditions that de- 
part strongly from historical trends (9). Of- 
ten, then, history serves less as a template 
and more as a guide for determining appro- 
priate restoration goals (19, 20). 
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4. Restoration benefits and engages soci- 
ety. Restoration focuses on recovering biodi- 
versity and supporting the intrinsic value of 
nature (27). It also provides a suite of eco- 
system services (e.g., improved water qual- 
ity, fertile and stable soils, drought and flood 
buffering, genetic diversity, and carbon se- 
questration) that enhance human quality of 
life (e.g., clean water, food security, enhanced 
health, and effective governance) (22). Resto- 
ration engages people through direct partici- 
pation and, thus, increases understanding of 
ecosystems and their benefits and strength- 
ens human communities (4). 

Parties to the U.N. Declaration will con- 
sider a variety of ways to achieve the new 
restoration commitments. We advocate 
adoption of all four principles as normative 
standards that assess intent at the planning 
stage, developed in conjunction with con- 
sideration of levels of uncertainty (in both 
the means and ends), the degree to which 
each principle could be attained, and legal 
or regulatory frameworks (23). Components 
that constitute ecological integrity will dif- 


“Our four principles provide 
a necessary foundation to 
achieve sustainability and 
resilience into the future.” 


fer across ecosystems; they will need to be 
described and made practical through best 
practice guidelines (4). The degree to which 
restoration can be self-sustaining will de- 
pend on landscape context; ongoing inter- 
ventions may be required in some cases to 
ensure ecological goals consistent with local 
context are met (6). Flexibility regarding the 
degree of historical fidelity will be needed 
to ensure success in rapidly changing envi- 
ronments (9). Ethical considerations can su- 
persede direct societal benefits, particularly 
when ecocentric ideals are followed (21). 

To achieve new restoration commitments, 
it will be tempting to consider specialized 
projects that emphasize one principle rather 
than attending to the full suite of potential 


opportunities. Degraded lands could be con- 
verted to carbon farms, where monocultures 
of fast-growing tree species are planted and 
managed to optimize carbon sequestration 
(24). Green infrastructure could provide 
vegetation that fixes carbon and increases 
permeable surfaces (25). As valuable as 
these strategies may be, they alone do not 
constitute comprehensive ecological resto- 
ration. To contribute to our commitment to 
restore, the scope of these strategies should 
be broadened to include all restoration prin- 
ciples (see the table). 

We urge parties to utilize all principles 
in their planning and to maintain a broad 
purpose. Although a comprehensive plan 
may require a more integrative approach 
than one aimed toward a specialized pur- 
pose, considering all four guiding princi- 
ples is most consistent with ecological and 
social science and most likely to realize ac- 
cepted benefits of restoration without net 
ecological loss. 

Our four principles provide a necessary 
foundation to achieve sustainability and 


Application of guiding principles for restoration 


Consideration of a comprehensive set of principles, with each principle situated along a continuum of effectiveness, should be a 
necessary provision of global restoration efforts. 


Four guiding principles 


ECOLOGICAL LONG-TERM BENEFITS AND INFORMED BY PAST 
EXAMPLES INTEGRITY SUSTAINABILITY ENGAGES SOCIETY AND FUTURE 
Mitigation Minimal Minimal Minimal Minimal 
Compensatory Project implementation Ongoing maintenance Net loss of aquatic Historical or reference 


mitigation for 
mountaintop mining 
impacts on streams, 
Appalachia, USA (26) 


most often based only 
on physical structure 


often required. Large 
changes in environmental 
context unaccounted for 


resources. Economic 
value of mining placed 
above environmental 
losses 


ecosystems evaluated by 
length or oversimplified 
stream “units” rather than 
functional metrics 


Ecosystem services 
Global Partnership on 
Forest and Landscape 
Restoration, Pamu 
Berekum, Ghana (27) 


Minimal 

Tree-planting focus, with 
little attention to diversity 
or other processes and 
functions. Planting targets 
in ha/year of plantations 


Moderate 

Increased rural livelihoods 
will decrease probability of 
unsustainable harvesting 


Exemplary 

Increased carbon 
sequestration and food 
production. Participatory 
planning. Better income 
and rural livelihoods 


Minimal 

Most focus on plantation 
methods, some attention 
to including valued 
indigenous tree species 


Urban greening 
Cheonggyecheon 
stream restoration, 
Seoul, South Korea 
(25) 


Moderate 

Increased biodiversity 
sixfold, including marsh 
plants, fish, and birds, but 
at considerable capital 
cost given central location 
in large urban region 


Minimal 

To keep river flowing, 
water must be pumped 
from Han River and 
underground reserves at a 
cost of >200 million yen 
per year 


Exemplary 

Reduced urban heat island 
effect and small-particle 
air pollution. Increased 
property values. Provided 
critical natural habitat for 
recreation in urban core 


Moderate 

Redirected underground 
waterways. Historically, an 
intermittent stream with 
strong cultural 
significance 


Endangered species 
El Segundo blue 
butterfly (Euphilotes 
battoides allyni), 
California, USA (28) 


Moderate 

Focus on host plant, coast 
buckwheat (Eriogonum 
parvifolium) 


Minimal 
Habitat protection and 
conservation 


Moderate 
Intrinsic value of species 
preservation 


Exemplary 

Preservation of last 
remaining coastal dunes in 
butterfly range 


Habitat restoration 
Postlogging stream 
restoration, Lyell 
Island, Gwaii Haanas, 
Haida Gwaii, British 
Columbia, Canada (4) 
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Exemplary 

Integrated approach to 
habitat restoration that 
focused on several 
aspects of diversity and 
function 


Exemplary 

Cessation of logging, 
relative absence of 
invasive species, cultural 
engagement, and assisted 
succession techniques 
ensure long-term success 
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Exemplary 

Significant cultural value 
in recovery of a focal 
group (salminoid fish), an 
important food source 
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Exemplary 

Clear use of intact 
contemporary reference 
ecosystems; goals 
included ecological and 
cultural continuity 
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resilience into the future. Ecosystems that 
are structurally and functionally diverse 
are more likely to be durable and capable 
of adapting to future challenges of climate 
change, introduced species, and land-use 
change and they can be sustained with a 
declining investment of human and finan- 
cial capital over time. Involving people 
through multiple avenues—from participa- 
tion to consumption of ecosystem services 
to cultural renewal—can promote public 
engagement and stewardship of local eco- 
systems. Adherence to these principles 
will add clarity, accountability, and accom- 
plishment in this new era of embracing 
ecological restoration as an environmental 
policy tool. & 
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GTEx detects genetic effects 


The genetic basis for variation among individuals 
in transcript abundance across tissues is analyzed 


By Greg Gibson 


ne of the lessons from the past several 

years of genomic analysis is that well- 

conceived, ambitious, and thought- 

fully analyzed genetic studies carried 

out by large consortia can advance 

the field in giant leaps. They do so 
both by providing new insight and by gener- 
ating data sets that are widely accessible to 
all investigators. It is thus remarkable that, 
even though we now know that the vast ma- 
jority of common polymorphisms (variants 
of a particular DNA sequence) that are as- 
sociated with disease risk act by modulating 
gene expression, “big science” transcription 
analyses have been lacking. This deficit is 
now addressed with the publication of the 
first results from the Genotype-Tissue Ex- 
pression (GTEx) Consortium (J), which also 
includes the findings of Melé et al. (2) and 
Rivas et al. (3), on pages 648, 660, and 666, 
respectively, in this issue. 

GTEx is an effort coordinated by the U.S. 
National Human Genome Research Institute 
to understand the genetic basis for variation 
among individuals in transcript abundance 
across many tissues (4). Hitherto, our knowl- 
edge of the genetics of gene expression in 
humans has derived mostly from studies of 
blood (5), lymphoblast cell lines (6), and iso- 
lated studies of accessible tissues such as fat 
or skin (7). The plan for GTEx is to associ- 
ate whole-genome sequence variation with 
RNA sequencing data for more than 50 tis- 
sue types from almost 1000 next-of-kin con- 
sented postmortem donors. This knowledge 
will provide direct evidence addressing the 
function of the many thousands of disease- 
associated variants supplied by genome-wide 
association studies (GWAS) and will illumi- 
nate mechanisms of variation for disease 
risk among healthy people. The pilot phase 
results (/-3) are based on data from the first 
237 donors, of whom around 100 have RNA 
samples analyzed in 9 tissues, with data 
from smaller subsets of donors available for 
33 other tissues. The main GTEx Consortium 
article reports on the genetic regulation of 
gene expression, whereas Melé et al. provide 
an overview of differences between the “tran- 
scriptome’—all RNA molecules, including 
messenger RNA, ribosomal RNA, transfer 
RNA, and other long noncoding RNA tran- 
scripts—across tissues and individuals. Rivas 
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et al. report on the effect that protein-trun- 
cating variants have on human transcrip- 
tion, generating a quantitative model of how 
nonsense-mediated decay (the elimination of 
transcripts that contain a premature stop co- 
don) varies across tissues and may be geneti- 
cally regulated. 

Previous studies in many organisms have 
established that common regulatory poly- 


“This knowledge will ... 
illuminate mechanisms of 
variation for disease risk...” 


morphisms (expression quantitative trait 
loci, or cis-eQTLs) located within a few hun- 
dred kilobases of a gene significantly influ- 
ence the expression of at least half of all 
genes in one tissue or another (8). They act 
locally to influence expression of a nearby 
gene, and may explain anywhere from a few 
percent to more than half the variance in 
abundance of the specific transcript among 
individuals. These effects are much larger 
than those typically associated with disease, 
so the largest eQTL effects can be detected 
with sample sizes of as few as 100 individu- 
als (9). It is to be expected that rare variants 
also contribute to disease, although their 
discovery is in its infancy. Epigenetic influ- 
ences such as chromatin modification and 
microRNA regulation certainly also explain 
substantial amounts of the variance. A criti- 
cal feature of transcriptional variation is the 
very high degree of co-regulation, sometimes 
of thousands of genes. This can be attributed 
to the collective effects of trans-acting regula- 
tory factors (transcription factors, hormones, 
environmental agents) as well as variation in 
the abundance of cell types within tissues. 
One of the major contributions of these 
first GTEx papers is quantification of the rel- 
ative contributions of cis-eQTLs in different 
tissues, suggesting (for example) that thy- 
roid and tibial nerve have twice the number 
of genes regulated by local polymorphisms 
than blood or heart (J). However, blood 
seems to have a relatively high level of allele- 
specific expression (transcription predomi- 
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nantly from one of the two chromosomes), 
whereas brain samples are depleted for this 
phenomenon. Interestingly, it appears that 
allele-specific expression is more strongly 
conserved among tissues within individuals 
than is overall transcript abundance (1). 
There has been much debate on how 
much shared genetic variation for gene ex- 
pression there is across tissues. Different 
analytical methods applied to comparisons 
of different data sets have led to diverse 
conclusions (7, 10). The GTEx Consortium 
tackles the issue by studying a common set 
of individuals with two different analytical 
approaches: pairwise linear models and 
joint Bayesian analysis. The conclusion is 
striking, namely that around half of all cis- 
eQTLs—particularly those proximal to a pro- 


of these are observed in multiple tissues, but 
not necessarily with the same expression 
profiles across tissues. It turns out that quite 
a few genes switch module “membership” 
between individuals (see the figure). These 
switches can be associated with local regula- 
tory variants (modQTLs), only about half of 
which were detected as cis-eQTLs in the rel- 
evant tissues (J). Because genes in modules 
often share regulatory motifs for a common 
set of transcription factors, and these motifs 
tend to harbor regulatory polymorphisms, 
the implication is that the full GTEx project 
will uncover new principles accounting for 
variation in the co-regulation of genes within 
and between tissues. 

What are the implications for person- 
alized medicine? These are barely hinted 
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Switches in modularity. Many genes tend to be coexpressed in modules. Modules may differ in their presence (gray) 
or size (yellow) among tissues and between individuals (blue). Some genes switch between modules [from the blue to 
yellow module in the brain (right); from the blue to gray module in the liver (left)]. 


moter—are active in the majority of tissues, 
whereas the other half tend to be specific to 
one or two tissues (J). Variants that affect 
splicing, namely the generation of alterna- 
tive transcript isoforms, also tend to be con- 
served across tissues with more than 80% 
detected in multiple tissues, but with a wide 
range of similarity among pairwise compari- 
sons: Whole blood shares fewer than 10% of 
its splicing QTLs with Sun-exposed skin, but 
almost 50% with the heart’s left ventricle. 
One of the big surprises reported (J) is 
the discovery of module QTLs GmodQTLs), 
which are regulatory variants that influence 
the co-regulation of gene expression. The 
idea is that most genes are organized into 
expression modules. Even though they are 
located on different chromosomes, they tend 
to be have similar expression levels (11). Of- 
ten, many genes in a module share a function 
such as controlling the cell division cycle. 
The reported analysis finds 117 modules of 
between 25 and 414 transcripts each. Some 
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at in the studies (1-3), but the most obvi- 
ous is validation of inferences from bio- 
informatics processing of GWAS data and 
information from the Encyclopedia of DNA 
Elements (ENCODE) project (which identi- 
fied functional elements in the human ge- 
nome sequence) (72). The measurement of 
chromatin features designed to annotate en- 
hancers and other regulatory elements has 
led to the realization that disease-associated 
variants tend to be enriched in the vicinity 
of genes that are more likely to be active in 
disease-relevant tissues, such as lymphoid 
cells in autoimmune disease or neurons in 
psychiatric disorders (13, 14). GTEx provides 
direct evidence that this is the case, and the 
project’s accompanying portal allows anyone 
to look up in which tissue a disease variant 
influences the expression of a nearby gene 
in a particular direction. Notably, very often, 
“nearby” does not mean the most adjacent. 
Additionally, the important point is made 
(1, 3) that profiling gene expression across 
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50 tissues demonstrates that genes encod- 
ing protein-truncating variants are actually 
not even expressed in the most relevant tis- 
sue. This implies that variants predicted to 
be deleterious on the basis of DNA sequence 
may actually be highly unlikely to contribute 
to a disease. 

Within the next 2 years, the full GTEx data 
should be available. There is no question that 
with an order of magnitude more data, the 
analyses will greatly exceed verification of the 
findings reported in the pilot studies. Vastly 
more cis-eQTLs will be found, intricacies of 
allele-specific expression and splicing will 
be worked out, and mechanisms responsible 
for switches in modularity inferred. We can 
also expect more complete integration of the 
GTEx data with the ENCODE analyses (13), 
using statistical approaches yet to be con- 
ceived, and a pilot ENCODE Tissue Expres- 
sion (EnTEx) project will report chromatin 
profiling of some GTEx samples. Such analy- 
ses will allow us to sift through the sugges- 
tive GWAS peaks and explain more of the 
variance for disease and attribute it to appro- 
priate cell types. Yet more ambitious GTEx 
projects might be conceived, evaluating how 
genetic regulatory effects vary in the context 
of disease (“GTEx-D”) and across environ- 
ments (“GTEx-E”), if investments are made 
in the research of genotype-tissue expression 
from patients who have chronic diseases or 
have lived with different lifestyles or environ- 
mental exposures such as toxins or severe so- 
cioeconomic stress (15). 

Perhaps most important, we can begin to 
outline an enhanced program of genome- 
enabled precision medicine. Although there 
is justifiable excitement about the ability of 
DNA sequencing to identify the causes of 
congenital abnormalities, to predict the pro- 
gression of tumors, and to personalize the 
prescription of drugs, the static genome has 
its limitations. If, 20 years from now, gene 
expression profiling is incorporated side-by- 
side with genotype analysis as a standard 
component of medical diagnostics, the GTEx 
project will be seen to have brought us closer 
to realization of this vision. ™ 
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Promises and perils 
for the panda 


AS A GLOBAL conservation icon and 
China’s national treasure, the endangered 
giant panda has received exceptional 
attention worldwide. Between 2003 and 
2013, giant pandas in the wild increased 
from 1596 to 1864, according to the results 
from the latest survey released by the State 
Forestry Administration (7). About two- 
thirds of the wild giant pandas and 53.8% of 
panda habitat are within nature reserves (2). 
Pandas in captivity are also thriving. 

However, there are long-term threats to 
panda survival and reproduction, including 
virus infection (“Captive pandas succumb 
to killer virus,’ M. Hvistendahl, In Depth, 
13 February, p. 700), habitat fragmentation, 
low genetic diversity, and human activities. 
As some old threats have diminished, new 
threats have emerged. Harvesting forests for 
fuelwood and timber has declined drasti- 
cally (2), but livestock grazing has increased 
(3). Although illegal poaching has dwindled, 
climate change is becoming a potential 
threat. For example, climate change could 
bring a substantial reduction in the distri- 
butional ranges of the dominant bamboo 
species that comprise almost the entire 
diet of the panda population in Qinling 
Mountains of Shaanxi Province (the region 
with the highest panda density across the 
panda range) (4). 

To ensure long-term panda survival, it 
is important to expand from short-term to 
long-term strategies from the perspective 
of coupled human and natural systems (5). 
For example, the State Forestry 
Administration plans to prohibit 
encroachments on existing 
reserves for economic develop- 
ment, but only for the next 2 
years. It is imperative to reduce 
human pressure in the long 
run by providing local residents 
with incentives to conserve pan- 
das and panda habitat (6) and 
by providing educational oppor- 
tunities that will enable young 
people living in the panda range 
to find jobs and settle in the cit- 
ies (7). Creating nature reserves 
in areas where no pandas cur- 
rently live (such as the northern 
side of the current panda range) 
might provide a safer haven for 
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NEXTGEN VOICES 


Postdocs 
reimagined: Last call 


You have one more week to respond to the 
NextGen VOICES survey! Share your thoughts 
about this question: 


Is the idea of the postdoc position obso- 
lete in today’s scientific landscape? If so, 
what should replace it? If not, what one 
change would you make to improve it? 


To submit, go to http://scim.ag/NG_15 


Deadline for submissions is 15 May. A selec- 
tion of the best responses will be published in 
the 3 July 2015 issue of Science. Submissions 
should be 200 words or less. Anonymous 
submissions will not be considered. 


future pandas as the climate changes. 

To respond to the Chinese government’s 
call for more panda conservation, detailed 
data from the panda surveys should be made 
readily available. Releasing the data can help 
to analyze how and why panda populations 
have changed in a particular region that was 
covered in different surveys. Sharing the data 
respects the hard work by more than 2000 
people surveying in the rugged mountains 
over 3 years in the latest panda survey alone. 
Most important, it can help boost promises 
and reduce perils for the panda. 

Jianguo Liu 
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The potential of 
secondary forests 


TROPICAL FORESTS ARE increasingly 
modified by human activities. Centuries 

of human-forest interactions have led to 

a diverse array of forest areas in different 
phases of succession. In recent decades, 
forest conversion to cattle pasture or agricul- 
tural fields, followed by land abandonment, 
has led to large areas of second-growth 
forest in the Amazon. These forests grow 
rapidly and sequester large amounts of 
carbon in their biomass, but they tend to 
be ignored, as most of the debate on the 
carbon balance of the Amazon basin tends 
to revolve around old-growth forests. 

For example, a recent study has shown 
that the net carbon uptake of Amazonian 
old-growth forests has declined by a 
third per decade from 1990 to 2010 (J, 

2). When extrapolated over the whole 
Amazon basin, these results translate into 
a reduced role of intact tropical forests in 
climate change mitigation. This alarming 
conclusion, however, completely ignores 
the important role of regenerating forests 
as carbon sinks. For instance, in 2010 
about 25% of formerly deforested areas 
in Para, Brazil, were occupied by second- 
growth forests. Although re-growing 
forests have lower carbon stocks (45 to 
48% of old growth forest), their net car- 
bon sequestration rate is up to 20 times 
higher (4.6 to 5.8 Mg carbon 
ha*™ year”) (3) than old-growth 
forests (7). Additionally, about 
one-quarter of the forests in 
the Amazon basin are man- 
aged for timber production. 
Net carbon sequestration 
rates after timber extraction 
are high, and the application 
of reduced-impact logging 
techniques further increases 
carbon sequestration rates (2.8 
Mg ha" year?! compared with 
0.5 for conventionally logged 
areas) (4). Consequently, it is 
essential to incorporate the 
carbon sequestration poten- 
tial of second-growth, logged, 
and managed forests in future 
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assessments of the Amazon basin as a 
global carbon sink. 
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Naming diseases: 
First do no harm 


IN RECENT YEARS, the world has seen the 
emergence of several new human infectious 
diseases. Given the rapid and global com- 
munication through social media and other 
electronic means, diseases are now often 
given common names by stakeholders outside 


as well as inside the scientific community. 
The use of names such as “swine influenza” 
and “Middle Eastern Respiratory Syndrome” 
has had unintentional negative economic and 
social impacts by stigmatizing certain indus- 
tries or communities. Disease names, once 
given, are difficult to change later even if an 
inappropriate name is being used. Therefore, 
it is important that an appropriate name is 
assigned to a newly identified human disease 
by whoever first reports it. 

In response to such concerns, the World 
Health Organization (WHO), in close collabo- 
ration with the World Organisation for Animal 
Health (OIE) and the Food and Agriculture 
Organization of the United Nations (FAO), 
and in consultation with the International 
Classification of Diseases (ICD) (J), has 
developed a set of standard best practices for 
naming new human infectious diseases, with 
the aim of minimizing unnecessary negative 
effects on nations, economies, people, and ani- 
mals. A full description of these best practices 
is available on the WHO Web site (2). 

These best practices apply to new 
infections, syndromes, and diseases of 
humans that have never been recognized 
or reported before in humans, that have 
potential public health impact, and for 
which no disease name is yet established 


in common usage. They do not replace the 
existing ICD system, but rather provide an 
interim solution prior to the assignment 
of a final ICD disease name. As these best 
practices only apply to disease names for 
common usage, they also do not affect the 
work of existing international authoritative 
bodies responsible for scientific taxonomy 
and nomenclature of microorganisms. 
WHO, OIE, and FAO strongly 
encourage all national, regional, and 
international stakeholders, including 
scientists, national authorities, and media, 
to follow these best practices in the event 
of the emergence of a new human disease, 
so that inappropriate disease names do not 
become established. 
K. Fukuda,'* R. Wang,’ B. Vallat’ 
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The whispering gallery at St. Paul’s 


PHYSICS 
Acircular route to confine electrons 


hysical barriers are used to confine waves. Whether it is 
harbor walls for sea waves, a glass disk for light, or the 
“whispering gallery” circular chamber walls in St. Paul's 
Cathedral for sound, the principle of confinement—reflec- 
tion—is the same. Zhao et al. used that same principle 
to confine electrons in a nanoscale circular cavity in graphene. 
Periodic patterns within the cavity were associated with an 
electronic wave version of whispering gallery modes. The 
tunability of the cavity size may provide a route for the manipu- 
lation of electrons in graphene and similar materials. — |SO 


Science, this issue p. 672 
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Molybdenum nanostructures 


convert natural gas 
Gao et al., p. 686 


HUMAN GENOMICS 
Expression, genetic 
variation, and tissues 


Human genomes show exten- 
sive genetic variation across 
individuals, but we have only 

just started documenting the 
effects of this variation on the 
regulation of gene expression. 
Furthermore, only a few tissues 
have been examined per genetic 
variant. In order to examine how 
genetic expression varies among 
tissues within individuals, the 
Genotype-Tissue Expression 
(GTEx) Consortium collected 
1641 postmortem samples 
covering 54 body sites from 175 
individuals. They identified quan- 
titative genetic traits that affect 
gene expression and determined 
which of these exhibit tissue-spe- 
cific expression patterns. Melé et 
al. measured how transcription 
varies among tissues, and Rivas 
et al. looked at how truncated 
protein variants affect expression 
across tissues. — LMZ 


Science, this issue p. 648, 
p. 660, p. 666; see also p.640 


GEOPHYSICS 
Silent slip events 


get shallow 


Clues to help better predict 

the likelinood of devastating 
earthquakes and tsunamis may 
be embedded in a more gentle 
type of rumbling. Using ocean- 
bottom seismometers, Yamashita 
et al. report rare observations of 
migrating tremors in the shallow 
part of a subduction zone off 
southern Kyushu, Japan. The 
tremors appear to be linked to a 
very low-frequency earthquake 
and seem to migrate to the 
region where big earthquakes 


Published by AAAS 


are generated. The tremors may 
be tracing how and where stress 
gets concentrated onto the 
earthquake-producing portion of 
the fault. — BG 


Science, this issue p. 676 


SURFACE SCIENCE 
Using friction to 
guide fabrication 


Ultralow friction found in 
certain layered materials such 
as graphite is important in the 
construction of nanomechanical 
devices. Koren et al. combined 
measurements and modeling 
to characterize the interaction 
of sliding graphite planes (see 
the Perspective by Liechti). This 
helped them to make small 
graphite devices that featured 
rotational pivots and multiple 
locking positions. — BG 

Science, this issue p.679 


BIOENGINEERING 
Keeping tabs 
on tiny worms 


Filarial nematodes—tiny para- 
sitic worms that can enter the 
bloodstream—are common in 
some parts of Africa. One of 
these worms, Loa loa, causes 
loiasis but is not compatible with 
current ivermectin-based mass 
drug administration (MDA) pro- 
grams. These programs aim to 
eliminate other worms that cause 
onchocerciasis and lymphatic 
filariasis. To pinpoint people to 
exclude from MDA, D'Ambrosio 
et al. devised a mobile phone- 
based strategy for quantifying 
Loa microfilariae in whole blood. 
A mobile phone video camera 
and a custom algorithm tracked 
the worm’s “wriggling” motion. 
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The device was packaged for 
point-of-care use—including its 
own smartphone “app” —and 
successfully diagnosed Loa- 
infected people in Cameroon, 
Africa. — MLF 

Sci. Transl. Med. 7, 286re4 (2015). 


An optogenetic tool 
to silence neurons 


Potassium channels in the cell 
membrane open and close in 
response to molecular signals 
to alter the local membrane 
potential. Cosentino et al. linked 
a light-responsive module to 
the pore of a potassium channel 
to build a genetically encoded 
channel called BLINK] that is 
closed in the dark and opens 
in response to low doses of 
blue light. Zebrafish embryos 
expressing BLINK1 in their neu- 
rons changed their behavior in 
response to blue light. — VV 
Science, this issue p. 707 


Extra dividends from 
measles vaccine 


Vaccination against measles has 
many benefits, not only lifelong 
protection against this poten- 
tially serious virus. Mina et al. 
analyzed data collected since 
mass vaccination began in high- 
income countries when measles 
was common. Measles vac- 
cination is associated with less 
mortality from other childhood 
infections. Measles is known to 
cause transient immunosuppres- 
sion, but close inspection of the 
mortality data suggests that it 
disables immune memory for 2 
to 3 years. Vaccination thus does 


Measles vaccination 
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more than safeguard children 
against measles; it also stops 
other infections taking advantage 
of measles-induced immune 
damage. — CA 


Science, this issue p. 694 


Inhibiting Hippo to 
mend broken hearts 


Activation of the Hippo signal- 
ing pathway prevents organ 
overgrowth. The pathway inhibits 
the activity of the transcriptional 
coactivator Yap, which is impor- 
tant during development. 
However, this same activity limits 
the ability of some organs to 
regenerate after injury. Morikawa 
et al. found that Yap target genes 
not only included cell cycle 
genes but also genes encoding 
cytoskeletal remodeling proteins 
or proteins that link the cytoskel- 
eton to the extracellular matrix. 
Cardiomyocytes from Hippo 
signaling—deficient mice formed 
cellular protrusions typical of 
migrating cells and more readily 
moved toward scar sites after 
cardiac injury. Thus, inhibiting the 
Hippo pathway could help with 
heart regeneration. — WW 

Sci. Signal. 8, ra41(2015). 


Away to dissect 
malaria’s secrets 


Malaria has exerted a strong 
selective force on the human 
genome. However, efforts to 
identify host susceptibility 
factors have been hindered 

by the absence of a nucleus 

in red blood cells. Egan et al. 
developed an approach 
involving blood stem cells 

to discover host 
factors critical for 
Plasmodium falci- 
parum infection of 
red blood cells. The 
authors identified 
an essential host 
receptor for parasite 
invasion that could 
provide a target 
for malaria thera- 
peutics. — SMH 


Science, this issue p. 711 
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Arole for macrophages in Rett syndrome 


cientists previously implicated microglia, a macrophage 
subset in the brain, in the pathogenesis of the neurode- 
generative disease Rett syndrome. To better understand 
how microglia and other types of macrophages might 
contribute to the development of the disease, Cronk et 
al. examined MECP2-deficient mice. Multiple types of macro- 
phages express MECP2 in normal mice, and several of these 
populations, including microglia, are lost in MECP2-deficient 
mice. MECP2 regulated a pro-inflammatory gene expres- 
sion signature in macrophages. When the authors selectively 
reexpressed MECP2 in macrophages, they were able to extend 
the lives of MECP2-deficient mice, which suggests that macro- 
phages probably contribute to disease pathogenesis. — KLM 


Methyl-C binding may 
explain Rett late onset 


Girls with Rett syndrome 
develop normally for the first 1 
or 2 years of life before suffering 
from progressive neurological 
problems, perhaps due to muta- 
tions in methyl-CpG—binding 
protein 2 (MeCP2). Chen et al. 
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Immunity 42, 679 (2015). 


analyzed the binding of MeCP2 
to genomic DNA in the mouse 
brain and found that before birth, 
MeCP2 binds predominantly to 
methylated CG sequences. After 
birth, there is an increase in the 
genome of methylated C ina 
non-CG context (mCH). MeCP2 
binds to many of these mCH 
sites, which are enriched in genes 
with neuronal functions. MeCP2 
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RESEARCH | IN OTHER JOURNALS 


ECOLOGY 


Thirty new fly species in Los Angeles 


rban environments are not usually considered to be rich in biodiversity; surveys of urban 
biodiversity often report limited and declining numbers of species. Hartop et al. show that 
they can nevertheless yield surprises, at least when it comes to insect diversity. Studying 
thousands of samples from private backyards in Los Angeles, CA, the authors have discov- 
ered 30 previously unknown species of Megaselia flies. Possible reasons for this unexpected 
diversity may be the wide range of geography in the Los Angeles area and the introduction of 
invasive species via its large container ports. It remains unclear, however, whether this diversity 
represents native or invasive species. — JFU 


Six of 30 new fly 
species discovered 
in Los Angeles 


binding modulates the transcrip- 
tion of these genes, some of 
which are implicated in Rett syn- 
drome, potentially explaining the 
late onset of the disease. — GR 


Proc. Natl. Acad. Sci. U.S.A. doi/ 
10.1073/pnas.1505909112 (2015). 


NEUROSCIENCE 
Sleeping while awake 


Sleep deprivation affects our 
behavior and performance. 
Bernardi et al. demonstrate 

the connection between task- 
specific performance decrease 
and local sleep in relevant parts 
of the human brain. During 24 
hours of wakefulness, individuals 
participated in driving simulations 
and executive function exer- 
cises. Their task-related abilities, 
such as visuomotor control and 
response inhibition, were tested 
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Zootaxa 10.11646/zootaxa.3941.4.1 (2015). 


alongside electroencephalog- 
raphy (EEG) recordings and 
functional magnetic resonance 
imaging (fMRI). Local EEG theta 
waves, normally observed during 
sleep, coincided with times of 
slower movements, visual inac- 
curacies, and decreased impulse 
control. The fMRI scans exposed 
cognitive fatigue in the form of 
regional neuronal disconnections 
in the task-relevant brain areas in 
addition to the general deficien- 
cies. — MSM 

J. Neurosci. 35, 4487 (2015) 


PHYSICS 


Tilting the field 
to expose a fraction 


In extremely clean, flat, and cold 
conductors, turning on an exter- 
nal magnetic field can produce 


the so-called quantum Hall 
effect (QHE), with the voltage 
across the sample exhibiting 
plateaus. The plateaus reflect 
the discrete energy levels of 
electrons in a magnetic field and 
can sometimes appear when 
only part of the level is filled: 
the fractional QHE. The most 
exotic of these states have filling 
factors with even denominators 
and may be useful in topological 
quantum computing. Falson et 
al. observed even-denominator 
states with filling factors 3/2 
and 7/2 at the interface of 
MgZnO and ZnO. Interestingly, 
one of the states was only 
observable when the research- 
ers tilted the magnetic field 
from the normal to the sample. 
—JS 


Nat. Phys. 11, 347 (2015). 
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MEDICINAL CHEMISTRY 
Proteins take steps 
to bind larger ligands 


In medicinal chemistry, opti- 
mizing the binding of a small 
molecule to a protein target 
often involves incremental 
changes to the small molecule. 
The design process may involve 
simulations of how a series of 
molecules binds to the target, 
but there is little experimental 
data to inform this analysis. 
Merski et a/. determined the 
structures of eight benzene 
compounds, each differing by 

a single methyl! group, bind- 

ing to the model protein T4 
lysozyme. Instead of gradually 
changing structure, T4 lysozyme 
transitions between three con- 
formations with different-sized 
ligand-binding sites. A search of 
the protein data bank revealed 
other examples of discrete 
changes in response to ligand 
size. Modeling such discrete 
conformations may be impor- 
tant in drug design. — VV 


Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pnas.1500806112 (2015). 


FLOW CHEMISTRY 
Round and round with 
a precious catalyst 


When manufacturing phar- 
maceuticals, it is crucial to 
make just one of two possible 
mirror-image structures. Soluble 
precious metal catalysts can 
facilitate that objective, but 
they're often hard to collect and 
reuse after the reaction is over. 
O'Neal et al. present a labora- 
tory-scale flow and filtration 
protocol to keep a ruthenium 
hydrogenation catalyst in 
continuous use for 24 hours. A 
membrane incorporated into 
the flow apparatus retains the 
catalyst in the reaction loop 
while releasing the product, all 
the while withstanding hydrogen 
pressure and a strongly basic 
co-catalyst. Less than 200 parts 
per billion of ruthenium escapes 
into the product stream. — JSY 
ACS Catal. 5, 2615 (2015). 
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RESEARCH 


Soil and human security in the 


21st century 


Ronald Amundson,* Asmeret Asefaw Berhe, Jan W. Hopmans, Carolyn Olson, 


A. Ester Sztein, Donald L. Sparks 


BACKGROUND: Earth’s soil has formed by 
processes that have maintained a persistent 
and expansive global soil mantle, one that in 
turn provided the stage for the evolution of the 
vast diversity of life on land. The underlying 
stability of soil systems is controlled by their 
inherent balance between inputs and losses of 
nutrients and carbon. Human exploitation of 
these soil resources, beginning a few thousand 
years ago, allowed agriculture to become an 
enormous success. The vastness of the planet 
and its soil resources allowed agriculture to 
expand, with growing populations, or to move, 
when soil resources were depleted. However, 
the practice of farming greatly accelerated 
rates of erosion relative to soil production, and 
soil has been and continues to be lost at rates 
that are orders of magnitude greater than mech- 
anisms that replenish soil. Additionally, agri- 
cultural practices greatly altered natural soil 
carbon balances and feedbacks. Cultivation thus 
began an ongoing slow ignition of Earth’s largest 
surficial reservoir of carbon—one that, when com- 


bined with the anthropogenic warming of many 
biomes, is capable of driving large positive 
feedbacks that will further increase the accu- 
mulation of atmospheric greenhouse gases and 
exacerbate associated climate change. 


ADVANCES: The study of soil is now the do- 
main of diverse schools of physical and bio- 
logical science. Rapid advances in empirical 
and theoretical understanding of soil processes 
are occurring. These advances have brought an 
international, and global, perspective to the study 
of soil processes and focused the implications 
of soil stewardship for societal well-being. Major 
advances in the past decade include our first 
quantitative understanding of the natural rates 
of soil production, derived from isotopic meth- 
ods developed by collaboration of geochemists 
and geomorphologists. Proliferation of research 
by soil and ecological scientists in the northern 
latitudes continues to illuminate and improve 
estimates of the magnitude of soil carbon 
storage in these regions and its sensitivity and 


response to warming. The role of soil pro- 
cesses in global carbon and climate models is 
entering a period of growing attention and in- 
creasing maturity. These activities in turn re- 
veal the severity of soil-related issues at stake 
for the remainder of this century—the need to 
rapidly regain a balance to the physical and 
biological processes that drive and maintain 
soil properties, and the societal implications 
that will result if we do not. 


OUTLOOK: Both great challenges and oppor- 
tunities exist in regards to maintaining soil’s 
role in food, climate, and human security. Ero- 
sion continues to exceed natural rates of soil 
renewal even in highly developed countries. 
The recent focus by economists and natural sci- 
entists on potential future shortages of phos- 
phorus fertilizer offers opportunities for novel 
partnerships to develop efficient methods of 
nutrient recycling and redistribution systems 

in urban settings. Possi- 


bly the most challenging 
issues will be to better un- 
derstand the magnitude 
of global soil carbon feed- 


backs to climate change 
and to mitigating climate 
change in a timely fashion. The net results of 
human impacts on soil resources this cen- 
tury will be global in scale and will have di- 
rect impacts on human security for centuries 
to come. 


The list of author affiliations is available in the full article online. 
*Corresponding author. E-mail: earthy@berkeley.edu 

Cite this article as R. Amundson et al., Science 348, 1261071 
(2015). DOI:10.1126/science.1261071 
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Large-scale erosion forming a gully system in the watershed of Lake Bogoria, Kenya. Accelerated soil erosion here is due to both overgrazing and 
improper agricultural management, which are partially due to political-social impacts of past colonization and inadequate resources and infrastructure. 
The erosion additionally affects the long-term future of Lake Bogoria because of rapid sedimentation. This example illustrates the disruption of the 
natural balance of soil production and erosion over geological time scales by human activity and the rapidity of the consequences of this imbalance. 
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RESEARCH 


SOIL SCIENCE 


Soil and human security in the 


21st century 


Ronald Amundson,'* Asmeret Asefaw Berhe,” Jan W. Hopmans,* Carolyn Olson,* 


A. Ester Sztein,” Donald L. Sparks® 


Human security has and will continue to rely on Earth’s diverse soil resources. Yet we have 
now exploited the planet’s most productive soils. Soil erosion greatly exceeds rates of 
production in many agricultural regions. Nitrogen produced by fossil fuel and geological 
reservoirs of other fertilizers are headed toward possible scarcity, increased cost, and/or 
geopolitical conflict. Climate change is accelerating the microbial release of greenhouse 
gases from soil organic matter and will likely play a large role in our near-term climate 
future. In this Review, we highlight challenges facing Earth’s soil resources in the coming 
century. The direct and indirect response of soils to past and future human activities will 
play a major role in human prosperity and survival. 


oil is the living epidermis of the planet (7). 
Globally, soil is the medium through which 
a number of atmospheric gases are biolog- 
ically cycled and through which waters are 
filtered and stored as they pass through 
the global hydrological cycle (2). Soil is a large 
and dynamic reservoir of carbon and the phys- 
ical substrate for most of our food production. 
Profound changes are on the horizon for these 
interconnected functions—particularly sparked 
by changes to climate and food production—that 
will likely reverberate through society this cen- 
tury. Ultimately, the way in which we directly and 
indirectly manage our planet’s soil will be inter- 
woven within our future success as a species. 
Soil is commonly thought of as the ~1-m-thick 
layer of biogeochemically altered rock or sediment 
at Earth’s surface that has acquired numerous 
qualities during its exposure to the atmosphere that 
greatly distinguish it from its geological sources (3). 
Soil-forming chemical reactions create micrometer- 
sized electrically negative clay minerals that im- 
part soil with plant nutrient retention capabilities 
(4). The electrical charge characteristics of soil, 
combined with its small particle size and high 
surface area, allow it to temporarily store rain and 
snow melt for plant use and provide sufficient 
residence time for a multitude of chemical reac- 
tions to occur that may remove or reduce the 
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toxicity of contaminants. The water stored in 
soil—termed green water (5)—serves as the source 
for 90% of the world’s agricultural production 
and represents ~65% of global fresh water (5). 
Last, the intimate intermingling of life—plant, 
animal, and microbial—within the soil matrix 
drives redox reactions that control many elemen- 
tal cycles (6) and creates a reservoir of organic 
C that greatly exceeds the C in the global atmo- 
sphere and biosphere (7). The microbial com- 
munities that mediate these redox reactions are 
now believed to represent much of Earth’s total 
biodiversity (8), but the nature, function, and 
economic potential of this soil biosphere is only 
beginning to be probed (6). 

Soil, due to global variations in climate, geol- 
ogy, and biota (3), has tremendous spatial diver- 
sity. More than 20,000 soil types (or soil series) 
have been identified and mapped in the United 
States alone (9), and the number identified in- 
creases as land area investigated increases. If the 
soil series—to-land area relationship (J0) is extrap- 
olated to global ice-free land area, the results 
suggest that there are more than 300,000 series 
on the planet. The response of these soils to per- 
turbations can be extremely varied because of their 
diverse chemical, physical, and biological char- 
acteristics, suggesting the importance, as a sim- 
ple precautionary principle, of maintaining segments 
of this diversity for the stability and resilience of 
global biogeochemical systems in the face of an- 
thropogenic disturbances. 


Human Imprint on Soil 


Humans altered the ecosystems they encoun- 
tered as they began their spread across the globe. 
However, the most momentous development in 
human landscape change occurred with the in- 
vention and adoption of agriculture (77). Most 
agricultural practices involve the removal of the 
natural flora, the simplification of biodiversity to 
favor monocultures, and the physical disruption 


of the soil. Since the Industrial Revolution, ex- 
panding populations have relied on the exploi- 
tation of more and more soil for a corresponding 
growth in food production. Today, ~12% of ice- 
free land is in cropland, and 38% is used for com- 
bined cropping and grazing (12), an area roughly 
equivalent to the land area covered by ice and 
scoured or otherwise disturbed during the last 
glacial maximum (Fig. 1A). In addition to the sim- 
ilarity in area, the agricultural impact on soil 
processes rivals or exceeds the effect of those ice 
sheets in both rapidity and magnitude. 

Undisturbed soils have the characteristic, as 
result of a number of feedback mechanisms, of 
being able to retain many of their features indef- 
initely over time—their thickness, C content, and 
nutrients, for example—a condition that is equat- 
able to sustainability (Fig. 2). Cultivated soils are 
highly modified forms of their wild predecessors 
and may thus be viewed as domesticated soils 
(9). One key characteristic is that domesticated 
soils seldom are able to maintain the qualities of 
their original conditions, and these changes 
greatly affect their productivity and their impact 
on surrounding geochemical cycles. The efforts 
to improve the management and conservation of 
these domesticated soils, and the preservation of 
portions of their remaining wild ancestral stock, 
will be among the most important challenges this 
century (9, 13). Analyses of the combined agricul- 
tural and urban impact on soil series in the United 
States, for example, revealed large areas in the 
agricultural heartland where more than 50% of 
the soil series had been domesticated. Soil diver- 
sity, like biodiversity (74), provides an array of 
human-valued goods and services. Among the most 
apparent issues is the ability of soil to provide 
sustained agricultural production. 

The domesticated soil landscape is one of Earth’s 
most valuable commodities. For example, nearly 
$3816 billion (U.S. dollars) in agricultural pro- 
ducts were produced globally in 2012 (15). How- 
ever, agriculture is competing with increasing 
urban and suburban soil demands. The conver- 
sion of soil to urban land is largely irreversible on 
human time scales. There is uncertainty both in 
the present and the future distribution of urban 
land on Earth (Fig. 1B). A recent meta-analysis 
suggests that between 1970 and 2000, an area 
greater than the size of Denmark was urbanized, 
and that in the next 20 years, 1.5 million km? of 
land (the size of Mongolia) will be urbanized (6). 
The conversion of farmland to urban areas must 
be weighed against the fact that our most pro- 
ductive soils have already been exploited and 
that demand for food production will continue 
to increase. 


Soil and Climate Security 


A relatively stable climate has been the stage 
on which the great human inventions of agri- 
culture and industrialization have evolved, and 
direct or indirect human impacts on soil C cy- 
cling processes will have much to do with atmo- 
spheric greenhouse gas concentrations and the 
associated climate implications by the end of this 
century. 
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Organic C stored in soil is the balance between 
plant inputs and microbially mediated metabolic 
losses as CO, (Fig. 2). In unperturbed conditions, 
soils achieve steady-state C pools on time scales 
of centuries to a few millennia. The total store of 
soil organic carbon is still uncertain, but recent 
estimates suggest pools on the order of 2300 giga- 
tons (Gt) in the upper 3 m (7). Soil cultivation 
and clearing has caused a major fraction of total 
anthropogenic greenhouse gas emissions since the 
19th century (77). Cultivation is a major disruption 
to the natural C balance in soil, one that alters 
the physical and biological structure of soil, effec- 
tively igniting, through microbially mediated pro- 
cesses, a vast store of labile C that has accumulated 
over millennia (8). During the first few decades 
that soil is cultivated, up to 50% of the carbon 
pool is oxidized to CO,; eventually, a quasi-steady- 
state soil C pool is achieved (19). Based on the 
global agricultural land area, cultivation has like- 
ly released between 50 and 70 Gt of C to the at- 
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humid climates (59). 


mosphere over the course of human history (20), 
and the combined cultivation and biomass burn- 
ing contributions to atmospheric CO, exceeded 
that of fossil fuel emissions well into the 20th 
century (17). However, the agricultural imprint 
on atmospheric greenhouse gas concentrations 
appeared much earlier in the Holocene (27). Early 
spikes in atmospheric CO, and CH, corresponded 
to agricultural expansion in Mesopotamia and in 
China (22). Much of the historical C loss was from 
the soils of forests and grasslands of the northern 
latitudes. However, today the locus of land alteration 
has shifted to the equatorial latitudes, and up to 
10% of global anthropogenic CO, emissions are from 
a combination of biomass burning and soil culti- 
vation in the humid and subhumid tropics (23). 

Under changed management or through land 
abandonment, global agricultural soils have the 
capacity to reapproach their original C storage 
and regain up to a half a decade of present fossil 
fuel emissions (over a multidecade period). Bet- 
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Fig. 1. The global patterns of soil properties and the human imprint on 
them. Expanding urban centers (55) remove soil from other uses, whereas 
the creation of cropland (56) is one of the major drivers of imbalances in the 
soil carbon cycle (57) and accelerated rates of soil erosion (58). Phosphorus, a 
critical plant nutrient, is unevenly distributed and inherently low in warm and 


ter stewardship of domesticated soils that leads 
to higher organic matter contents is a valuable 
practice from an ecological perspective and from 
an agronomic point of view (24). There is now a 
large body of research on the rates of C seques- 
tration under differing management practices. 
However, there are limits to these practices as a 
means of mitigating continued fossil fuel emis- 
sions. First, a serious concern with management- 
based soil C sequestration strategies is that they 
are dependent on restricted management op- 
tions in a highly decentralized and economically 
driven agricultural sector (25). A change in land 
ownership, or a change in factors driving agricul- 
tural practices, can rapidly release much of regained 
C. Second, the effectiveness of soil C sequestration 
is time dependent. For example, if all potential 
soil sequestration strategies were established, 
they would initially serve as a sink of about 1.3 Gt 
of C year” (Table 1) (26), but this sink term 
would be expected to decline nonlinearly to low 
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sequestration rates over a period of several dec- 
ades as a new soil C steady state is reached (27). 
Equally as important is the difficulty of actually 
achieving this maximum potential, which involves 
multiple governments and millions of individual 
land managers. Last, soil management effects on 
the global C balance are inherently small relative 
to the climate-driven changes to soil C storage 
that will occur in this century. 

Soil C storage is well documented to decline 
with increasing temperature and decreasing soil 
moisture, and soil C storage patterns mirror glob- 
al climate zones (28, 29), with secondary impacts 
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by bedrock, topography, and soil age (20). Soil C 
pools are the balance between plant inputs and 
microbial decomposition (Fig. 3), and the responses 
of these processes to anthropogenic climate change 
are considered to be large, (in the case of inputs) 
poorly constrained (30), and complicated by tem- 
perature and moisture interactions. Anthropo- 
genic increases in atmospheric CO, may drive 
increased net primary production (NPP) as long 
as nutrient and water limitations do not occur 
(31), which ultimately may have a negative feed- 
back on atmospheric CO, through increased in- 
puts to soil C (Fig. 3). On the other hand, increasing 
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Fig. 2. Changes in the balance of important soil processes caused by human disturbance. Many 
soil characteristics are the balance or the result of a number of processes that respond to changes in 
environmental variables (3). However, properties such as hillslope soil thickness, organic carbon storage, 
N content, and other features attain steady state in intervals of a few centuries to millennia and appear 
capable of regaining stability. Human intervention in soil processes many times exceeds natural per- 
turbations and thus exceeds the resiliency of soil to recover to its original condition. Viewed broadly, 
steady state is a quantitative measure of soil sustainability. 


Table 1. Published estimates of soil C sources and sinks for the 21st century. 


Maximum Flux Cumulative Flux 


Management Type (Gt of C year) Reference (Gt) 
Carbon Sinks 

Increased net primary production (31) 160 to 1230 

Erosion 0.40 (37) 40 
Management 

Cropland management 0.36 (26, 27) 

Grazing land management 0.37 (26, 27) 

Restore degraded land 0.18 (26, 27) 

Restore organic soils 0.36 (26, 27) 

Total for management 1.26 16.4* 

Carbon Sources 

Land clearingt 250 

Soil warming 

Boreal regions (32) 50-270 

Globe (34) 104-629 

Total 

Net balance -188 to +137 


*Based on maximum new cropland by 2050 (10 billion ha) (68) and assumed loss of 25% of an average C 
content of 10 kg m™~@ (29). tCalculated assuming exponential decline to an ultimate landscape 
saturation after 50 years. 
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air temperatures warm soil, melt permafrost, and 
stimulate biological metabolism of soil carbon 
pools, driving what appears to be a large positive 
feedback process (32) (Figs. 1C and 3). Based on 
current earth system modeling, additions of soil 
C by increased NPP (relative to an 1850 BCE ref- 
erence date) are projected to be between 160 and 
1230 Gt by 2100, whereas C losses by increased 
decomposition are projected to be between 104 
and 629 Gt (31). Overall, models suggest net soil 
C changes from a loss of 72 Gt to gains of 253 Gt 
by 2100 (37). However, such exercises include great 
uncertainties in both projected gains (by CO,- 
enhanced photosynthesis) and losses (by soil warm- 
ing) and in assumptions about long-term ecosystem 
response to ever-increasing CO. concentrations. 
One important uncertainty is the response of 
northern latitude soils to warming, which could 
result in net soil C losses between 50 to 150 Gt 
(32, 33). Last, the current generation of earth 
system models has difficulty in matching present- 
day soil carbon storage patterns (34), and tuning 
the models is challenged by empirical uncertain- 
ties in the global soil C pool of more than 770 Gt 
(34), an uncertainty similar in size to the present 
atmospheric C pool. 

Still debated is the impact of soil erosion on 
the global C cycle. When agricultural soil is lost 
by water or wind erosion, the surficial, and most 
C-rich, material is preferentially removed, which 
accelerates the decline in the soil C pool. Rates of 
soil C replacement by crops and plants are rapid 
enough in certain situations to maintain soil C 
levels at a steady state under the condition of 
constant erosion—e.g., creating an ongoing sink 
(35). This sink represents a net reduction in 
atmospheric CO, only if the eroded C is not re- 
oxidized. Because some depositional environments 
are conducive to partial preservation of buried 
C dakes, reservoirs, basins, floodplains), the net 
effect of accelerated agricultural erosion was first 
suggested to be a global C sink of 0.6 to 1.5 Gt 
year ', a rate similar to the total global land sink 
(35). If the eroded C is largely oxidized, however, 
it may result in no net sink (or possibly even a 
net source) (36). The most recent estimates sug- 
gest that agricultural erosion of soil C may be 
0.40 +0.20 Gt of C year‘ (37). If we benefit from 
an unintended C sink due to soil erosion, any 
benefits must be clearly balanced against the 
related losses of nutrients and reduction of en- 
vironmental quality that require fossil fuel energy 
to remediate (38). 

The global soil C cycle has been greatly per- 
turbed by human activity, both directly through 
farming and indirectly through anthropogenic 
climate change. All projected soil C gains and 
losses this century are highly uncertain because 
of economic, population, and political influences 
(which will largely affect carbon sequestration 
efforts) and uncertainties in the magnitude of the 
soil response to warming (because of the com- 
plexity of the soil C pool structure) (Fig. 3). Hu- 
man changes to the global atmosphere and 
climate are likely to simultaneously drive both 
very large gains and losses of soil C—fluxes that 
are equivalent to decades of emissions at present 
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Fig. 3. A cause-effect diagram of the major soil-atmospheric CO2 feed- 
back processes. The values in parentheses, in the circles, are the approximate 
pool sizes (Gt) of C. A solid arrow represents a direct response (e.g., as CO 
increases, temperature increases); an arrow with a circle indicates an inverse 
response (e.g., as temperature increases, soil carbon storage decreases). Dashed 
lines are for processes that are less well understood. (A) Environments with 
adequate moisture. (B) Water-limited environments. In (A), the a-b-c loop is a 
positive feedback process (even number of inverse relationship arrows), one 
especially important in regions of melting permafrost. The d-f-c loop is a negative 
feedback, one with less certain feedbacks between vegetation and soil (f-g loop) 
and temperature (h-i). The strength of the a-b-c- versus d-f-c loops on soil carbon 
pools will likely determine whether soil carbon losses in northern latitudes serve 


rates of fossil fuel consumption. The presently 120 
unknown balance, and most importantly its sign, 
between the large fluxes represent considerable 
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as a major source of COz and CH, this century, a balance that also hinges on the 
ability of the soils to supply nutrients to plants (arrow g) in order to respond to the 
increases in CO> (arrow d). In (B), regions with limited moisture, the strengths of 
vegetation response to COz (d) and soil carbon response to temperature (b) may 
be weakened (thinner arrow). In addition, the vegetation response to increasing 
temperature may become negative. These figures reveal the importance of soil 
carbon to the global COz balance this century, as well as the uncertainties in the 
strength and direction of important processes. Arrow references are as follows: a, 
(60); b, (32, 33); c, COz loss by respiration is the overwhelming pathway of C 
removal from soils; d, water efficiency response in (61); f, soil C is the balance 
between plant inputs and decomposition losses; g, not well constrained, but see 
discussion in (62); h, (63); i, e.g., (64). 
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uncertainty for climate security (Table 1). 


@ Anydrous ammonia 


Soil and Food Security 


The late 20th and early 21st centuries have been, 


for industrialized countries, an unprecedented 
era of increasingly low food prices (39). There are 
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(40); and competition for soil by nonagricultural 
uses, such as biofuels or urbanization. Abundant 
energy has been the key driving force behind our ) — 
ability to maintain food production apace with 
an expanding population that is estimated to reach 
11 billion by 2100 (42). Low-cost energy, which 
led to advanced agricultural machinery replacing 
human labor, is causing migration to urban cen- 
ters. Energy is used to replace the soil nutrients 
removed or lost by agricultural perturbations of 
soil. Energy transforms atmospheric N, to the 
bioavailable NH, fertilizer through the energy- 
intensive Haber-Bosch process—constituting the 


straining the supplies. 
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Fig. 4. The post—World War II rise in fertilizer production and cost coincides with the spike in 
global human populations. The growth in world population in the late 20th century (65) mirrors the 
increasing use of industrially derived N fertilizer. Before N fertilization, for major U.S. crops (66), wheat and 
other grain yields per acre increased only following World War II, coincident with the rise in the use of N 
fertilizer produced by the Haber-Bosch process. Prices have sharply risen (67) because global demand is 


first and most important green revolution (42) 
(Fig. 4A), one that allows us to feed the increas- 
ing global population (Fig. 4A). Before the indus- 
trial fixation of N, any increase in food production 
for a given country was largely due to increased 
soil used for production (43), and only after the 
advent of N fertilizer (Fig. 4A) did yields per area 
of major crops begin their upward trajectory (43). 
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Last, energy is essential to mine and transport 
essential plant nutrients, such as P and K, that can 
only be accessed from limited geological reservoirs. 

Agricultural soil erosion is one of the most de- 
structive human perturbations to soil sustain- 
ability. Given little opportunity or desirability for 
further agricultural expansion, stewardship of our 
existing domesticated soil is essential for sus- 


tained human prosperity. Yet despite the impor- 
tance of soil conservation, the implementation of 
practices to minimize soil erosion has not fol- 
lowed apace with the severity of the problem. The 
most pervasive mechanism of soil erosion is via 
water. Before European contact and the removal 
of native vegetation by plowing and cultivation, 
the geological mechanism of soil erosion on most 
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uplands was by slow, biologically driven creep 
(44). The removal of plant cover allows the mecha- 
nism of erosion to change to raindrop dislodg- 
ment of soil particles and their subsequent removal 
by overland flow, which is a far more rapid erosional 
process. For example, recent analyses of U.S. 
erosion rates before European contact place the 
average rate at about 21 m My” (My, million years) 
(45). Today, erosion rates in the central United 
States can exceed 2000 m My’ (45), whereas the 
rates of soil erosion in portions of the loess pla- 
teau of China approach 10,000 m My ™ (46) (Fig. 
1D). This eroded sediment is ultimately replaced 
by the conversion of the underlying sediment or 
rock into new soils with the addition of organic 
matter and nutrients through biological mecha- 
nisms. Until the past decade, the pace of this re- 
placement process was poorly known, and the 
acceptable rates of soil erosion on agricultural 
lands were placed at 400 m My™ or more (47). Nu- 
merous studies of natural rates of soil production 
now suggest rates between 50 to 200 m My for 
many environments, indicating that the pace of 
erosion in numerous agricultural areas is, or un- 
til recently has been, unsustainable (47). Not only 
does the loss of soil remove nutrients from the 
site of agricultural production (38), but the sedi- 
ment generated adversely affects local drainages, 
water bodies, and regional aquatic ecology. Last, 
the maintenance (or even the improvement) of 
agricultural production in the face of accelerated 
rates of soil erosion is energy intensive. Although 
microbial symbiosis with plants can fix atmospheric 
N to bioavailable forms and can substitute for N 
fixed by the Haber-Bosch process, there is no bi- 
ological or atmospheric source for rock-derived 
nutrients, such as P, K, and Ca. 

Although natural processes of soil production 
and formation replace or release nutrients, the 
paces of these processes are slow relative to our 
anthropogenic use rate (Fig. 2). The transport of 
crops from the site of production to other loca- 
tions remove plant-essential nutrients from the 
soil, potentially causing deficiencies that limit 
potential production levels (48). This further 
drives a dependence on the mining and distri- 
bution of macronutrients from geological sources, 
which can create economic inequalities or geo- 
political conflicts between nations (49) (Fig. 1E). 
The growing demand for P has recently caused 
an increase in the cost of rock phosphate from 
about $80 per U.S. ton in 1961 to up to $450 per 
ton in 2008. Prices since then have fluctuated 
but are now at about $700 per ton (50) (Fig. 4). 
In addition to cost is the related issue of access. 
Morocco is estimated to have the world’s largest 
geological P reserves, much of it in disputed ter- 
ritory (49). The United States, on the other hand, 
has only about 2% of global P resources (57). At 
current rates of retrieval, the most productive 
mine in the United States will be depleted in 
20 years (49), which will force it to become in- 
creasingly reliant on imports to sustain its agri- 
cultural and industrial sectors. 

Because most other P-reliant countries lack the 
geological resources to indefinitely sustain cur- 
rent use, the only means to confront the decline 
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in reserves (other than conversion from domestic 
to imported P) is to develop a more coherent and 
integrated program of P (and other nutrient) re- 
cycling. The loss of nutrients in our human and 
animal waste streams is environmentally damag- 
ing and economically problematic. Regaining con- 
trol of these resources, now largely considered 
waste, would go far toward substantially lower- 
ing the demand for imported nutrients and other 
resources (52). In addition to P, other soil nu- 
trients appear to be entering periods of limitation 
or high demand (Fig. 1F). For example, K (potash) 
prices were ~$875 per metric ton in 2009 and are 
expected to reach $1500 by 2020. 


The 21st Century Challenge 


Humans have domesticated our soil resources 
and the planet (12, 53). This domestication has 
in turn perturbed a number of soil cycles such 
that they are no longer in balance, and the im- 
balance is changing soil in ways that will affect 
future generations and their climate (Fig. 2). Soil 
management must be geared toward passing a 
habitable, albeit highly altered, landscape to the 
generations that follow—one where our exploita- 
tion of, and impacts on, soil resources is adjusted 
to the pace of our planet’s renewal. These strat- 
egies should focus on regaining a balance in (i) 
organic C inputs and losses, (ii) soil erosion and 
production, and (iii) release and loss of nutri- 
ents. Soil sustainability—based on quantitative 
principles and measurements of soil erosion and 
production, soil nutrient loss and release, and soil 
carbon loss and return—must be the ultimate goal 
for managing the global soil resource and should 
serve as the driving principle for soil research 
that will support this management. 

These are challenging goals that will be diffi- 
cult to achieve. The solutions will require an ef- 
fort commensurate with the magnitude of the 
problems. First, effective solutions to soil sustain- 
ability, much like the approaches required to con- 
tend with climate change (54), must involve highly 
multidisciplinary research in novel intellectual 
settings or institutions. Second, the ultimate suc- 
cess of any innovation requires a dialog and in- 
terface with policy makers and public institutions, 
the ultimate “deciders” in broad-scale social change. 
These linked efforts will depend on continued, 
and arguably much greater, investments in knowl- 
edge and innovative knowledge transfer and simply 
different ways of conceptualizing and approaching 
problems. From our vantage point, the future of 
Earth’s soil resources is tenuously in our control 
or within our ability to sustain it into the future. 
Only those on Earth in 2100 will know how well 
we succeeded. 
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HUMAN GENOMICS 


The Genotype-Tissue Expression 
(GTEx) pilot analysis: Multitissue 
gene regulation in humans 


The GTEx Consortium*t{ 


Understanding the functional consequences of genetic variation, and how it affects 
complex human disease and quantitative traits, remains a critical challenge for 
biomedicine. We present an analysis of RNA sequencing data from 1641 samples 

across 43 tissues from 175 individuals, generated as part of the pilot phase of the 
Genotype-Tissue Expression (GTEx) project. We describe the landscape of gene 
expression across tissues, catalog thousands of tissue-specific and shared regulatory 
expression quantitative trait loci (eQTL) variants, describe complex network relationships, 
and identify signals from genome-wide association studies explained by eQTLs. 

These findings provide a systematic understanding of the cellular and biological 
consequences of human genetic variation and of the heterogeneity of such effects 


among a diverse set of human tissues. 


ver the past decade, there has been a marked 

increase in our understanding of the role 

of genetic variation in complex traits and 

human disease, especially via genome- 

wide association studies (GWAS) that have 
cataloged thousands of common genetic var- 
jants affecting human diseases and other traits 
(1-3). However, the molecular mechanisms by 
which this genetic variation predisposes in- 
dividuals to disease are still poorly character- 
ized, impeding the development of therapeutic 
interventions. 

The majority of GWAS variants are noncoding, 
likely manifesting their effects via the regulation 
of gene expression (4, 5). Thus, characterization 
of the regulatory architecture of the human ge- 
nome is essential, not only for understanding ba- 
sic biology but also for interpreting GWAS loci. 
Expression quantitative trait locus (eQTL) anal- 
ysis (6-8) is the most common approach used to 
dissect the effects of genetic variation on gene 
expression. However, comprehensive eQTL data 
from a range of human tissues are lacking, and 
eQTL databases are biased toward the most 
accessible tissues. Additionally, although many 
regulatory regions act in a tissue-specific man- 
ner (9, 10), it is unknown whether genetic var- 
jants in regulatory regions have tissue-specific 
effects as well. Complex diseases are often caused 
by the dysfunction of multiple tissues or cell 
types, such as pancreatic islets, adipose, and 
skeletal muscle for type 2 diabetes (JI, 12), so it 
is not obvious a priori what the causal tissue(s) 


*A list of authors and affiliations appears at the end of the 
paper. 
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are for any given GWAS locus or disease. Hence, 
understanding the role of regulatory variants, 
and the tissues in which they act, is essential for 
the functional interpretation of GWAS loci and 
insights into disease etiology. The Genotype- 
Tissue Expression (GTEx) Project was designed 
to address this limitation by establishing a 
sample and data resource to enable studies of 
the relationship among genetic variation, gene 
expression, and other molecular phenotypes 
in multiple human tissues (13). To facilitate the 
collection of multiple different tissues per do- 
nor, the project obtains recently deceased do- 
nors through consented next-of-kin donation, 
from organ donation and rapid autopsy set- 
tings. The results described here were gener- 
ated during the project’s pilot phase, prior to 
scaling up collection to 900 donors. All project 
data are made available at regular intervals 
to qualified researchers through dbGaP. Sum- 
mary data are available on the GTEx Portal 
(http://gtexportal.org). 


Study design 


During the pilot, we recruited 237 postmortem 
donors, collecting an average of 28 tissue sam- 
ples per donor spanning 54 distinct body sites 
(fig. S1 and tables S1 and S2). Blood-derived DNA 
samples were genotyped at approximately 4.3 mil- 
lion sites, with additional variants imputed using 
the 1000 Genomes phase IJ, resulting in ~6.8 mil- 
lion single-nucleotide polymorphisms (SNPs) with 
minor allele frequency (MAF) of =5% after qual- 
ity control (tables S3 to S5) (14). 

RNA was extracted from all tissues, but quality 
varied widely, with tissue site and sample spe- 
cific ischemic time accounting for ~40% of the 
variance in RNA quality (fig. $2). To maximize 


statistical power, we prioritized RNA sequencing 
of samples from nine tissues that were most fre- 
quently collected and that routinely met mini- 
mum RNA quality criteria: adipose (subcutaneous), 
tibial artery, heart (left ventricle), lung, muscle 
(skeletal), tibial nerve, skin (Sun-exposed), thyroid, 
and whole blood (Table 1) (74). 

We performed 76-base pair (bp) paired-end 
mRNA sequencing on a total of 1749 samples, of 
which 1641 samples from 43 sites, and 175 do- 
nors, constituted our final “pilot data freeze” 
reported on here (74). Median sequencing depth 
was 82.1 million mapped reads per sample (fig. 
S3A). The final data freeze included samples from 
43 body sites: 29 solid-organ tissues, 11 brain sub- 
regions (with two duplicated regions), a whole- 
blood sample, and two cell lines derived from 
donor blood [EBV-transformed lymphoblastoid 
cell lines (LCLs)] and skin samples (cultured fi- 
broblasts) (Table 1 and tables S1 and S2). Median 
sample size for the nine high-priority tissues was 
105; median sample size for the other 34 sampled 
sites was 18.5. 


Gene expression across tissues 


We examined the patterns of expression of 53,934 
transcribed genes across tissues [on the basis of 
Gencode V12 annotations] (74, 15). The number of 
biotypes [protein-coding genes, pseudogenes, and 
long noncoding RNAs (incRNAs)] that were tran- 
scribed above a minimal threshold [reads per kilo- 
base per million mapped reads (RPKM) > 0.1] was 
similar for most tissues (average of 20,940) (fig. 
S3B). Testis was an outlier, with substantially more 
transcribed regions detected than other tissues 
(31,240 on average), many of which are IncRNAs. 
Whole blood was also an outlier, exhibiting the 
fewest detected transcribed regions (17,160 on 
average). 

Hierarchical clustering demonstrated that 
expression profiles accurately recapitulate tis- 
sue type, with blood samples forming the pri- 
mary outgroup (Fig. 1A). The multiple brain 
regions cluster strongly together as a single 
unit, but among those the 11 individual sub- 
sampled regions are less distinct (Fig. 1A and 
fig. S4A). The most distinct brain region is the 
cerebellum (fig. S4A) (J6), with preservation 
method having little impact on that signal (fig. 
S4B). The distribution of gene expression across 
tissues is described by Melé et al. (17), who show 
that tissue-specific transcription is typically dom- 
inated by a few genes that vary from tissue to 
tissue. 

We quantified splicing events (splice junc- 
tions, exons, transcripts) by estimating exon 
inclusion levels, measured as PSI (percent spliced 
in) scores (14, 18). Clustering samples by PSI 
scores also largely, although less clearly, recapi- 
tulates tissue type. Samples from the brain, not 
blood, form the primary outgroup (Fig. 1B), 
which is divided into two groups: A group of 
227 samples (from the cerebellum and cortex) 
forms an independent subcluster (cluster 1), and 
a smaller group of 97 samples (cluster 2, dom- 
inated by the remaining subregions) clusters 
closer to samples from the rest of the tissues. 
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Table 1. GTEx pilot samples. Characteristics of the 1641 RNA-sequenced samples included in the pilot data freeze. The second and third columns show the 
tissue abbreviation and color assigned to each tissue (used throughout). The nine tissues prioritized for sequencing are indicated by red stars. Boxes highlight two 
regions of the brain that were sampled in duplicate. A region each from the cerebellum and cortex (BRNCHA and BRNCTXA) was sampled on site during initial 
tissue collection, and again after the brain was received by the brain bank (BRNCHB and BRNCT XB). Cell lines included are an EBV-transformed lymphoblastoid 
cell line from blood (LCL) and a cultured primary fibroblast cell line from fresh skin (FIBRBLS). RIN, RNA integrity number. 


Tissue Site Detail 


Hr Adipose - Subcutaneous 
Adipose - Visceral (OQmentum) 
Adrenal Gland 
Artery - Aorta 
Artery - Coronary 

Artery - Tibial 
Brain - Amygdala 


Brain - Anterior cingulate cortex (BA24) 


Brain - Caudate (basal ganglia 


Brain - Cerebellum [PAXgene] 
Brain - Cerebellar Hemisphere [Frozen 


Brain - Cortex [PAXgene] 


Sample Ischemic 


RIN Time (min) Age 
Abbreviation Color n Mean SD Mean SD Mean 
ADPSBQ | 94 69 06 421.9 318.3 48.2 
ADPVSC ig 7.2 06 401.4 303.1 46.9 
ADRNLG | 8.3 0.8 172.8 107.2 51.4 
ARTAORT (24 7.8 0.8 262.0 178.3 50.8 
ARTCRN io 75 O07 312.4 307.9 55.3 
ARTTBL | ss 7.5 0.7 486.7 350.0 47.9 
BRNAMY 23 69 860.8 NA NA 51.2 
BRNACC es si7 7.0 0.7 NA NA 51.4 


BRNCDT 


BRNCTXA 


NA 
300.9 
NA 
280.3 


52.5 
51.6 
49.7 
51.3 


868.9 


837.2 


Brain - Frontal Cortex (BA9) [Frozen] BRNCTXB NA NA 55.1 
Brain - Hippocampus BRNHPP 6.9 0. NA NA 51.1 
Brain - Hypothalamus BRNHPT | 7.0 0.9 NA NA 51.4 
Brain - Nucleus accumbens (basal 
ganglia) BRNNCC aa 28 74 06 NA NA 53.4 
Brain - Putamen (basal ganglia) BRNPTM 20 7.3 0.9 NA NA 49.5 
Brain - Spinal cord (cervical c-1) BRNSPC Sic 7A 0.7 NA NA 52.5 
Brain - Substantia nigra BRNSNG | 6.8 0.7 NA NA 53.8 
Breast - Mammary Tissue BREAST 27 7.0 07 645.6 425.4 50.3 
Cells - EBV-transformed lymphocytes LCL | sO) 9.9 0.2 59.7 502.0 46.2 
Cells - Transformed fibroblasts FIBRBLS 14 9.5 0.5 544.5 478.4 49.2 
Colon - Transverse CLNTRN 12 75 0.8 236.5 137.0 46.3 
Esophagus - Mucosa ESPMCS Sis 8.6 0.7 330.9 219.6 51.7 
Esophagus - Muscularis ESPMSL fee 20 7.9 0.6 310.6 273.3 48.5 
Fallopian Tube FLLPNT 1 6.1 NA 520.0 NA 51.0 
Heart - Atrial Appendage HRTAA 25 75 07 492.3 323.9 51.0 
IK Heart - Left Ventricle HRTLV 83 79 09 380.9 334.4 48.0 
Kidney - Cortex KDNCTX 3 68 04 583.0 585.0 56.3 
Liver LIVER 5 76 09 365.2 321.8 42.8 
Hung LUNG si i9 76 80.9 447.0 380.4 48.9 
Muscle - Skeletal MSCLSK | sR 7.9 06 486.1 358.1 49.2 
Nerve - Tibial NERVET cs 7.0 0.7 463.8 315.6 49.4 
Ovary OVARY 6 7.3 07 401.0 248.4 43.5 
Pancreas PNCREAS [9 19 6.8 0.7 200.4 115.3 49.3 
Pituitary PTTARY 13 73 ©6006 841.3 305.5 51.9 
Prostate PRSTTE 9 68 O07 231.1 84.0 50.2 
Skin - Not Sun Exposed (Suprapubic) SKINNS 23 7.2 0.7 557.2 388.2 48.7 
> Skin - Sun Exposed (Lower leg) SKINS ns soos 7.0 07 498.5 364.0 49.0 
Stomach STMACH | 12 74 0.9 250.0 131.6 47.8 
Testis TESTIS 14 7.0 09 293.6 236.3 52.4 
$e Thyroid THYROID S105 7.0 0.7 428.5 391.7 49.2 
Uterus UTERUS 7 74 1.0 313.3 184.3 48.7 
Vagina VAGINA 6 78 1.1 414.5 234.4 54.0 
¥* Whole Blood WHLBLD Ey) stse 8.1 0.8 238.4 498.2 49.7 
All 1641 7.5 0.9 418.9 394.2 50.2 


This is consistent with isoform regulation play- 
ing a comparatively larger role in defining 
cellular specificity in the brain (/8, 19). These 
analyses are extended in Melé et ail. (17) to define 
tissue-specific splicing signatures and to inves- 
tigate in depth the role of individual variation 
in splicing. 


eQTL analyses: Single-tissue eQTL analysis 


A primary goal of the GTEx project is to iden- 
tify eQTLs for all genes for a range of human 


tissues. Past studies, hampered by the difficul- 
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ties of obtaining human tissue samples, have 
typically examined no more than three tis- 
sues (8, 20). Although our pilot sample sizes 
are modest for eQTL discovery, the breadth 
of tissues provides an opportunity to assess 
differential eQTL discovery among tissues. Be- 
cause of our small sample sizes, we primarily 
examined eQTLs that act in cis to the gene (cis- 
eQTLs; see box S1), as the expected effect size of 
trans-eQTLs (box S1) is too low to be efficiently 
detected at this time. We calculated cis-eQTLs 
separately for each of the nine tissues with 


sufficient sample sizes (>80 donors) for all 
SNPs within +1 Mb of the transcriptional start 
site (TSS) of each gene (/4). Significance correla- 
tions between genotypes and gene expression 
levels were determined by linear regression on 
quantile normalized gene-level expression val- 
ues, after correction for known and inferred 
technical covariates (fig. S8) (14), using Matrix 
eQTL (21). To obtain gene-specific significance 
levels while correcting for testing multiple SNPs 
per gene, we computed permutation-adjusted 
P values for each gene for the most significant 
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SNP per gene (14). We defined “eGenes” as genes 
with at least one SNP in cis significantly asso- 
ciated, at a false discovery rate (FDR) of <0.05, 
with expression differences of that gene (box S1) 
(14). A list of the significant SNP-gene pairs de- 
tected per eGene can be found on the GTEx Por- 
tal (http://gtexportal.org). 

The number of eGenes ranged from 919 in heart 
to 2244 in thyroid, with a total of 6486 unique 
eGenes across the nine tissues (Fig. 2A). Rerun- 
ning the analysis on successively downsampled 
donor subsets from each tissue showed an ap- 
proximately linear relationship between eGenes 
and sample size (slope of ~21 eGenes per sam- 
ple; Fig. 2A). Interestingly, thyroid and nerve 
share a steeper slope with ~29 significant eGenes 
per sample, whereas muscle and blood share a 
shallower trajectory with ~15 eGenes per sample, 
which may reflect the lower transcript complex- 
ity observed for these two tissues (17). The 
number of eGenes identified showed no signs 
of plateauing at current sample sizes in any of 
the tissues. 

Consistent with previous work (20, 22), the ma- 
jority of the significant cis-eQTLs clustered around 
the TSS of target genes in all nine tissues (fig. S9, 


A Gene expression clustering 


B Exon (PSI) clustering 


A and B, and fig. S10). The eQTL signals also 
tended to show an upstream bias (fig. SIO and 
tables S6 and S7); an average of ~80% of signif- 
icant eQTLs fell within +100 kb around the TSS, 
and ~60% of all eQTLs were upstream despite 
testing a similar fraction of SNPs upstream and 
downstream of the TSS (tables S6 and $7) (23). A 
slight but distinct overrepresentation of nonsig- 
nificant eQTLs around the TSS (relative to other 
SNPs near the gene; red versus black line in figs. 
S9A and S11A) supports the existence of additional 
eGenes, which did not meet significance with cur- 
rent sample sizes. 

To investigate the sensitivity and validity of 
our study, in particular because we used tis- 
sues from deceased donors, we compared the 
GTEx blood eQTLs to a previously reported 
eQTL study of whole blood samples in ~5300 
individuals (7). Although many experimental 
and processing differences exist between these 
studies, a considerable fraction of GTEx eGenes 
(68%) were replicated in this study at FDR < 5% 
(14). Given the incomplete overlap in variants 
tested, we also compared our eQTLs against a 
smaller study of 911 blood samples taken from 
the Estonian Biobank, where we were able to 


apply a similar eQTL analysis pipeline to that 
used in GTEx and hence get better SNP cov- 
erage (fig. S11) (14). Notably, 98% of GTEx blood 
eQTLs at FDR < 5% showed consistent allelic 
direction with those eQTLs (P < 10°, binomial 
test) (14). 


Multitissue joint discovery of eQTLs: 
Tissue specificity and sharing of eQTLs 


The specificity or sharing of eQTLs among dif- 
ferent tissues and cell types is of considerable 
biological interest (8, 22, 24), yielding insights 
into differential genetic regulation among tissues. 
Furthermore, cross-referencing tissue-specific 
eQTLs with disease genetic associations could 
help identify tissues most relevant to disease 
biology. We used the GTEx pilot data to exam- 
ine eQTL sharing across multiple tissues and 
leveraged the large tissue range to discover weak 
but constitutively active eQTLs. 

We investigated patterns of eQTL sharing 
using 22,286 genes (with RPKM > 0.1 in at 
least 10 samples) for each of the nine tissues 
with both a simple non-model-based analysis 
of every pair of tissues (22) and more sophis- 
ticated Bayesian models for joint analysis of 


1 re — oe z i = Oi il SD ie ~ —— 


Cluster 1 ICluster 2 
1 

Brain subregions (@@ Frontal cortex (BA9) All tissues (main tissues in bold) 
Ml Amygdala lig Hippocampus ™) Adipose tissue § Colon Mm Lung Prostate 
8 Anterior cingulate cortex (BA24) Milli Hypothalamus M8 Adrenal gland {i Esophagus ll Muscle Reproductive organs 
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Fig. 1. Sample clustering based on gene expression and exon splicing profiles. (A) Clustering performed on the basis of gene expression values for all genes 


from Gencode v12 annotation. Tissue type is the primary driver of expression di 


ifferences, with the nonsolid tissues (blood and LCL cell lines) clustering separately 


from solid tissues. Hierarchical clustering was performed using as distance = 1 — Pearson correlation, and average method. (B) Sample clustering based on the 
“percent spliced in” (PSI) values for exons across samples. Tissue differentiation is less clearly a driver, and brain is now the main outgroup, driven largely by a 


cluster comprised of cerebellum and cortex samples. 
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all nine tissues (24, 25). Analyses focused on a 
+100-kb window surrounding the TSS for each 
gene, which is smaller than used for the single- 
tissue analysis, as this is where we observe the 
highest eQTL density. 

The non-model-based pairwise analysis meth- 
od identifies significant SNP-gene pairs in a 
first tissue, and then uses the distribution of 
the P values for these pairs in the second tis- 
sue to estimate 7, the proportion of non-null 
associations in the second tissue (22, 26). Esti- 
mated values of m, ranged from 0.54 to 0.90 
(Fig. 2B). For every pair of tissues, we observed a 
high level of sharing of eQTLs, and this pairwise 
sharing of eQTL among tissues roughly mir- 
rored the tissue gene expression correlations 
(fig. S12). 

To assess patterns of sharing beyond tissue 
pairs, we applied two Bayesian methods that 
assess, for each SNP-gene pair, the evidence 
for each of the 512 (2°) possible null/alternative 
eQTL configurations. The first method (24) is 
“gene-based” and assumes a single causal eQTL 
per gene. We extended the model to (i) sup- 
port the fact that not all tissues were sampled 
across all GTEx donors; (ii) calculate a gene- 
level FDR without requiring permutations (27); 
and (iii) include a fine mapping approach across 
multiple tissues (28). We also used a “SNP-based” 
approach (25), which assesses association of 
each SNP-gene pair separately, working directly 
with the g-statistics for each SNP-gene pair 
within each tissue [see also (29)]. We summa- 
rized the estimates of tissue specificity using 
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marginal posterior probabilities for the num- 
ber of tissues in which a randomly selected gene 
(gene-based model) or SNP-gene pair (SNP-based 
model) is active (Fig. 2C). Both approaches show 
a U-shaped pattern, with high tissue specific- 
ity (activity in a single tissue) or tissue ubiq- 
uity (activity in all nine tissues) more common 
than profiles involving only a few tissues, de- 
spite many more possible combinatorial pat- 
terns for intermediate specificity. Notably, both 
methods indicate that more than 50% of all 
detected eQTLs are common to all nine tissues. 
Reassuringly, both Bayesian methods produce 
pairwise tissue sharing probabilities that show 
agreement with the non-model-based analysis 
(fig. $13). Figure S14 illustrates the value of the 
multitissue analysis for an example in which the 
tissue specificity of an eQTL supports NDRG4 
as a candidate to influence QT interval in the 
heart (30). Despite superficial similarity between 
patterns of tissue-tissue gene expression and 
eQTL sharing, extensive comparisons of eQTL 
evidence to expression levels indicate that the 
tissue-specific patterns are only modestly cor- 
related with average tissue expression levels 
(fig. S15). 

We also examined effect size estimates for 
eQTLs that are shared between tissues. The 
vast majority of shared eQTLs show consist- 
ent effect directions in different tissues. How- 
ever, some SNPs showed apparent opposite 
effect directions in different tissues (fig. S16). 
A number of examples that we investigated using 
multi-SNP multitissue analyses (28) appeared 
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to be due to pairs of SNPs, in LD with one an- 
other, that were separate eQTLs in different 
tissues (fig. S17) rather than being a single eQTL 
with opposite effects. 

Beyond the biological interest of eQTL shar- 
ing among tissues, the availability of multiple 
tissues can also increase the power to detect 
eQTLs active in multiple tissues by combining 
information across tissues (24). To investigate, 
we conducted a permutation analysis by hold- 
ing the expression data fixed and permuting 
the genotypes. In this manner, we identified the 
significant eGenes per tissue (as in the single- 
tissue analysis above, but for a smaller window 
near the TSS), or jointly for any combination 
of tissues by considering the minimum P val- 
ue across tissues under each permutation. With 
this approach, the number of eGenes identi- 
fied for individual tissues was similar to the 
single-tissue eQTL analysis (Fig. 2A). Next, 
the minimum P value for each gene across 
all nine tissues was used to test for eQTL evi- 
dence, and subjected to gene-level false discov- 
ery control. A total of 7425 eGenes with FDR < 
0.05 were identified, representing a factor of 
3 increase relative to the maximum number 
of significant eGenes for any single tissue. The 
Bayesian models, which leverage the high pro- 
portion of tissue-common eQTLs, increase the 
power to detect eQTLs for an individual tissue 
by borrowing strength from the remaining tis- 
sues. Thus, of the original 22,286 expressed genes, 
10,030 showed a significant eQTL (FDR < 0.05) 
with the gene-based Bayesian multitissue model 


Fig. 2. Number and sharing of significant cis- 
eQTLs per tissue. (A) Numbers of significant 
cis-eQTL genes (eGenes) per tissue according to 
single-tissue analysis. For each gene, the minimum 
nominal P value was used as the test statistic and 
an empirical P value was computed to correct for 
number of tests per gene, based on either permu- 
tation analysis of genotype sample labels applied 
to the full set of samples per tissue (#) or Bonferroni 
correction, used for downsampling (line) to reduce 
computational burden (14). In the range of sample 
sizes tested, the number of identified eGenes in- 
creases linearly with sample size. (B) Dendrogram 
and heat map of pairwise eQTL sharing using the 
method of Nica et al. (22). Values are not symmet- 
rical, since each entry in row i and column j is an 
estimate of ; = Pr(eQTL in tissue i given an eQTL in 
tissue /). Blood has the lowest levels of eQTL shar- 
ing with other tissues while adipose shows higher 
levels of sharing. (C) Activity probabilities for both 
multitissue modeling approaches, applied to all nine 
tissues, indicate that the most likely configurations 
are for eQTLs that are active in only a few tissues or 
in many tissues. (D) For eQTLs in each tissue con- 
sidered separately, analyzing multiple tissues jointly 
increases the number of discovered eQTL associa- 
tions (FDR < 0.05), as assessed by the SNP-based 
multitissue model. 
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(approximately 35% more than by joint per- 
mutation). Using the SNP-based Bayesian mod- 
el, we selected four of the tissues with the 
largest sample sizes, and computed the mar- 
ginal eQTL probability for each tissue alone 
after adding one to eight additional tissues in 
a fixed sequence. This resulted in a marked in- 
crease in the number of significant SNP-gene 
pairs (FDR < 0.05) (Fig. 2D). For downstream 
analyses using the multitissue results, a single 
SNP with highest posterior probability was chosen 
as representative across all tissues, recogniz- 
ing that multiple causal SNPs would be difficult 
to resolve with the current sample size. Pos- 
terior probability profiles for both Bayesian 
models, as well as marginal posterior proba- 
bilities that an eQTL is active in each tissue, 
based on summing the probabilities across all 
configurations, are available for these genes 
(http://gtexportal.org). 


Allele-specific expression analysis 


To complement eQTL analysis, we used allele- 
specific expression (ASE) of genes to indirectly 
estimate the overall effect of cis-regulatory var- 
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iants on the expression of nearby genes. Indi- 
viduals that are heterozygous for a cis-regulatory 
variant may differentially express each of the 
two alleles of the affected gene. In these cases, 
assuming no other causes for allele-specific 
expression, imbalanced allelic expression of a 
gene can be used to infer the effect of cis reg- 
ulation on the gene even without necessarily 
identifying the specific regulatory variant(s). 
In addition to the subtle cis-regulatory effects 
described here, ASE of premature stop variants is 
also informative of nonsense-mediated decay (30), 
and consistent monoallelic expression of spe- 
cific genes can be used to detect epigenetic si- 
lencing by imprinting. 

ASE can be estimated from the abundance 
of RNA-seq reads supporting each of the gene’s 
alleles, measured at heterozygous sites within 
the transcribed region. We identified a me- 
dian of 6385 sites per sample (across all tissues 
and all individuals) that were both heterozy- 
gous for a transcript SNP in the imputed geno- 
type data of the associated individual and 
well expressed in that sample (SNP covered 
by = 30 RNA-seq reads) (fig. S18). Of these 


sites, a median of 390 sites per sample had 
significant ASE (binomial P < 0.005, or 144 
sites with FDR 21%, after rigorous quality con- 
trol of ASE sites and after downsampling to 
exactly 30 reads at each site to ensure equal 
power to detect ASE) (Fig. 3A and table S8A) 
(14). To assess whether different tissues vary 
in terms of the overall effect of cis-regulatory 
variation in them, we estimated the propor- 
tion of those sites with significant ASE in each 
of the tissues (combining data from all indi- 
viduals) (14) and found that the fraction of 
significant ASE sites varied widely across tis- 
sues, with a range of 1.7 to 3.7% (median 2.3%). 
Brain regions appeared depleted for allelic ef- 
fects (2.0% ASE in brain versus 2.7% in other 
tissues combined; P < 2 x 107°, Fisher’s exact 
test) (Fig. 3A), consistent with the hypothesis 
that brain-specific genes exhibit reduced ge- 
netic diversity (32). Our results generally sup- 
port our eQTL findings; in thyroid, where we 
observed a high number of eQTLs (Fig. 2A), we 
also see an increased amount of ASE (Fig. 3B). 
Blood and LCLs have a higher proportion of al- 
lelic effects (3.6% versus 2.3% in others; P < 2 x 
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Fig. 3. Quantification of regulatory diversity by ASE. (A) Proportion of 
sites with significant ASE (P < 0.005) in each tissue (colored and labeled 
as in Table 1), with binomial confidence intervals. (B) Proportion of sig- 
nificant ASE sites for the nine tissues with eQTL data as a function of the 
proportion of eQTLs after regressing out the log of sample size. (C) Par- 
titioning variation in allelic and total gene expression within and between 
individuals and tissues. We calculated pairwise Spearman rank correla- 
tions between all the samples using two metrics [(D) and (E)]. (D) Allelic 
ratios over sites (Sampled to 30 reads each), which captures similarity in 
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allelic effects that are a proxy for cis-regulatory variation. (E) Total read 
counts over the same sites, which captures similarity in total gene 
expression levels. The plots show the distributions of pairwise correla- 
tions for sample pairs that are from (1) different tissues and different 
individual, (2) different tissues within an individual, or (3) same tissues in 
different individuals. Gene expression levels are highly correlated within 
the same tissue (E3) (see Fig. 1A). However, allelic ratios show highest 
correlation among different tissues of the same individual that share the 
same genome (D2). 
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10", Fisher’s exact test) (Fig. 3B), suggesting 
nongenetic sources of monoallelic expression 
possibly due to the more clonal nature of these 
cells (33). 

To compare between-sample and between-tissue 
sharing of ASE with the overall similarity of gene 
expression, we constructed two Spearman rank 
correlation matrices between all pairs of samples. 
In one, we correlated ASE ratios, capturing shar- 
ing of cis regulation; in the other, we correlated 
total coverage of the same sites, capturing sim- 
ilarity of overall gene expression levels, analyzing 
genes that are expressed in both tissues (both ma- 
trices used counts of reads covering heterozygous 
sites shared between the compared pair of sam- 
ples) (74). The two matrices of tissue medians were 
highly correlated (Mantel test, 7 = 0.772, P < 0.0001; 
fig. S19), indicating that tissues with similar gene 
expression profiles also have a higher degree of 
sharing of genetic regulatory effects. 

Interestingly, when partitioning the full sample- 
level matrix into correlations between samples 
from different tissues of the same individual, the 
same tissue across individuals, or different indi- 
viduals and different tissues (Fig. 3C), we found 
more complex relationships between total expres- 
sion and allelic ratios. As expected, expression 
levels are determined by tissue, and samples clus- 
ter by tissue (75.6% of variance; Fig. 3E and table 
S8B). However, allelic ratios show the opposite 
pattern (Fig. 3D), with higher correlation among 
tissues within the same individual (17.9% of 


variance) than among individuals for the same 
tissue (8.6%). These results indicate that ASE is 
primarily determined by the common genome 
among different tissues of the same individual 
(34). This suggests that the two dimensions of 
gene expression variability, gene expression levels 
and allelic ratios, are largely defined by indepen- 
dent factors. 

ASE analysis can also be used as orthogonal 
confirmation of eQTL effects, on the basis of the 
expectation that individuals who are heterozygous 
for a cis-eQTL variant should manifest biased al- 
lelic ratios in the eQTL target gene. We performed 
this analysis at the genome-wide level across all 
the tissues in the GTEx data set, including the 
tissues with sample sizes that were too small for 
eQTL analysis. Examining the significant eQTLs 
identified by single-tissue analysis for the nine 
main tissues, we looked at their ASE effects sep- 
arately in each of the 42 tissues, calculating the 
odds ratio of significant versus nonsignificant 
ASE for eQTL heterozygote versus homozygote 
individuals (e.g., for thyroid eQTLs we looked for 
ASE in all 42 tissues, then for blood eQTLs and 
so on) (figs. S20 and S21). In addition to repli- 
cation of eQTL signals in the discovery tissue, we 
can estimate how relevant our eQTL findings 
from nine tissues are to a wide variety of other 
tissues (independent of sample size or allele fre- 
quency) and then assess which tissue is the best 
proxy for capturing regulatory effects in another 
tissues of interest. 


We found that the overall tissue specificity of 
the eQTL sets varies. For example, eQTLs identi- 
fied in skeletal muscle were less active in other 
tissues. Some tissues, such as brain, were not well 
captured by any of the nine eQTL sets here. ASE 
analysis also allows quantification of genome-wide 
tissue sharing of cis-regulatory signals without rely- 
ing on eQTL discovery, but instead by measur- 
ing how often a site with a significant ASE signal 
in one tissue is significant in another tissue of the 
same individual. The proportion of shared ASE 
effects between tissue pairs within an individual 
varies between 36% and 58% (mean 46%), with 
slightly increased sharing observed between close- 
ly related tissues (conditioning on each site being 
measurable in both of the tissues) (fig. S22A). This 
relatively high degree of sharing is consistent with 
the eQTL results described above. If we relax the 
constraint of requiring a gene to be expressed in 
both tissues, then the proportion of shared ASE 
effects is substantially lower (0.85 to 39%, mean 
11%; fig. S22B). This finding represents the total 
probability of detecting a regulatory effect in an- 
other tissue, and highlights a high degree of ap- 
parent tissue specificity that derives from the fact 
that a gene expressed in one tissue is often sim- 
ply not expressed in another. This is particularly 
pronounced in brain, where the large proportion 
of genes showing tissue-specific isoform expres- 
sion in that organ (17) drives a lower degree of 
overall sharing. Whole blood and skeletal mus- 
cle are partial outliers, relative to other tissues, 
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Fig. 4. Cis-regulatory effects in individuals that are not explained by 
detected eQTLs. (A) An eQTL showing individual homozygous (AA) for the 
eQTL SNP (left panel) or heterozygous (AG) (right panel). ASE is measured at 
the TC SNP. (B) An example of replication of an eQTL signal in ASE analysis in 
the NDRG4 gene, with eQTL heterozygotes showing higher ASE in the eQTL 
target gene than eQTL homozygotes (only a subset of individuals shown; linear 
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regression P = 5.69 x 10°). The error bars are from a binomial test for the 
allelic ratio. (C) For each eQTL gene where the eQTL signal was replicated in 
ASE (linear regression P < 0.05 after Bonferroni correction), the eQTL het- 
erozygotes show higher variance in allelic ratio (Mann-Whitney P = 2.13 x 107”). 
(D) Permuted P value for the variance between individuals, which is higher than 
expected in 22/53 genes (9 genes in homozygotes, 20 in heterozygotes). 
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with lower sharing of eQTLs and lower average 
replication in other tissues (Fig. 2A, fig. S12, 
and figs. S20 to S22). 

Although eQTL analysis and other association- 
based analyses are efficient for identifying common 
variants with phenotypic effects in populations, 
the genotype of those variants may not be a good 
indicator of the gene expression phenotype at an 
individual level. We investigated this using ASE 
analysis to estimate how consistent the allelic ef- 
fect of single-tissue eQTL variants are across indi- 
viduals that share the same genotype for the best 
associated eQTL variant per gene. We tested a 
subset of 606 eQTLs that had high read coverage 
and a large number of samples for transcript het- 
erozygous sites in the eQTL discovery tissue. We 
identified 53 eQTLs with significant replication 
of the eQTL signal in ASE, as inferred by higher 
allelic imbalance in eQTL heterozygotes than 
in homozygotes (linear regression P < 0.05 after 
Bonferroni correction; Fig. 4, A and B). Further, 
for 22 of the 53 eQTL genes, individuals show 
variability in their allelic ratios that cannot be 
accounted for by the eQTL genotype or sampling 
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error (Bonferroni corrected P < 0.05 from per- 
mutation of read counts; Fig. 4, C and D). These 
results highlight that common eQTLs do not ex- 
plain all of the cis-regulatory effects in individ- 
uals and are inadequate predictors of allelic 
expression variance at the individual level. The 
higher variance in eQTL heterozygotes than in 
homozygotes (Mann-Whitney P = 2.13 x 1077; 
Fig. 4B) suggests that part of this variance 
might originate from modification of the main 
eQTL effect by environmental or trans effects, 
or due to additional, independent regulatory 
variants (34). 


Analysis of splicing QTLs 


Beyond estimating overall levels of gene expres- 
sion, RNA-seq data also allow for the quantifica- 
tion of expression levels of individual transcript 
isoforms, as well as components of these, such as 
exons, splice junctions, and untranslated regions. 
We refer to the quantitative variation of gene struc- 
ture due to genetic variation as splicing QTLs 
(sQTLs) (Fig. 5A). To identify sQTLs, we used 
Altrans (35), a method that identifies SNPs that 


are associated with variation in the expression 
levels of exon junctions (sjQTLs; box S1) (74), and 
sQTLSeekeR (36), a method that identifies SNPs 
associated with the variation in the relative abun- 
dances of gene transcript isoforms (srQTLs; box 
S1) (7/4). Altrans identifies both novel and anno- 
tated splicing events, while sQTLSeekeR tests 
only annotated isoforms. Altrans, however, is 
restricted to changes in the usage of splice junc- 
tions, while sQTLSeekeR can in principle detect 
any variation in the relative abundance of tran- 
script isoforms (fig. S23). Altrans was run using 
a +1 Mb region around the TSS, while for 
sQTLseekeR, we tested within the body of the 
gene +5 kb (4). 

We detected an average of ~1900 genes with 
Altrans, and ~250 with sQTLseekeR, with at least 
one cis-sQTL per tissue (FDR = 0.05; table S9). 
The greater genomic window tested around the 
TSS for Altrans, and the capacity to detect novel 
splicing events, explains the much larger number 
of sQTLs detected by Altrans than by sQTLseekeR 
(about 70% of Altrans sQTLs correspond to novel 
events, and only about 10% map within 5 kb of 
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annotated genes). Despite the difference, Altrans 
and sQTLseekeR show strong complementarity, 
detecting variants that are associated with very 
different types of alternative splicing events. Most 
of the splice events detected by Altrans (80%) are 
exon-skipping events, while 60% of those de- 
tected by sQTLseekeR correspond to complex 
splice events, such as mutually exclusive exons 
(Fig. 5B and fig. $23). If we consider only sQTLs 
associated with exon-skipping events, the over- 
lap in sQTLs identified by both methods is sub- 
stantial (86% of sQTLseekeR are also found by 
Altrans, P = 0.004) (figs. S23 to S25). An example 
of an sQTL that we identified with potential bi- 
ological relevance is shown in Fig. 5A (further 
examples are in fig. S26). 

Significant sQTLs show a high degree of 
sharing among tissue pairs, with tissue-specific 
sQTLs accounting only for 7 to 21% of the total 
depending on the tissue (figs. S27 and S28). 
The highest degree of sharing is between heart 
left ventricle and whole blood, whereas the two 
tissues that share the fewest sQTLs are whole 
blood and Sun-exposed skin. In general, sQTLs 
identified in whole blood are shared at lower 
levels with other tissues, as was observed for 
eQTLs (Fig. 2A and fig. S12, A and B). Although 
we observe the same enrichment of sQTLs around 
the TSS as seen for eQTLs, the sQTLs that are 
shared across multiple tissues tend to be closer 
to the TSS than those that are tissue-specific 
(fig. S28). On average, 20% of sQTLs associated 
with changes in exon junction abundance by 
Altrans were also predicted to be eQTLs (x, = 
0.20, 1, = 0.14 to 0.27; table S10). An even larger 
fraction (48%) of sQTLs detected by sQTLseekeR 
associated with changes in relative abundances 
of gene transcript isoforms, were identified as 
eQTLs (1; = 0.48, 1 = 0.13 to 0.70; table S10). This 
enrichment of eQTLs among sQTIs is larger 
than expected at FDR < 0.05, but a substantial 
fraction of SQTLs are unique and not detected by 
standard eQTL analysis. 


Functional annotation of eQTLs in 
noncoding regions 


Genetic variants affecting gene expression and 
splicing patterns have been shown to fall within 
regulatory elements, providing a potential mo- 
lecular basis for their effects (37-39). To assess 
whether the eQTLs discovered across the nine 
tissues were enriched in regulatory regions, we 
used a set of regulatory annotations from the 
ENCODE project (0) and the Epigenomics 
Roadmap project (40), including regulator-bound 
locations, DNase I hypersensitivity sites, and 
maps of histone modifications for proximal and 
distal regulatory regions (J4). For each tissue, we 
chose the top significant SNP per gene from the 
single-tissue eQTL analysis (14,431 eQTL SNPs). 
Discarding SNPs that were within annotated 
genes resulted in 4085 intergenic eQTL SNPs, 
which were compared to our regulatory anno- 
tations (14). Intergenic eQTL SNPs were en- 
riched for transcription factor-bound sites, open 
chromatin, promoters, and enhancers (P = 4.3 x 
10°18, 2.9 x 10-8, 17 x 10°", and 0.003, respec- 
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tively) relative to the density of these features 
within a 2.5-kb window of the tested eQTLs (fig. 
$29 and table S11). This enrichment was even 
more pronounced in a subset of 91 unambiguous 
intergenic eQTL SNPs (fig. S29). In contrast, we 
found sQTLs to be enriched within annotated 
splice junctions relative to other functional re- 
gions (Fig. 5C). 

We also asked whether the specific SNP-gene 
links (“genetic links”) predicted by our eQTL analy- 
sis were supported by enhancer-gene links based 
on functional correlation (“functional links”) (14). 
We found that open chromatin regions contain- 
ing an eQTL SNP show higher correlation to the 
corresponding gene, relative to open chromatin 
regions without an eQTL at similar distances and 
linked to genes also containing an independent 
eQTL (r = 0.037 versus 0.030, paired Mann-Whitney 
two-sided P = 4.3 x 10", SE = 0.00114, n = 32,168 
eQTL-SNP pairs). 

This demonstrates the value of eQTL infor- 
mation to establish functional links between 
regulatory elements and genes in the genome, 
an analysis that will increase in power as ad- 
ditional tissues and subjects are added to the 
data set. 


Gene network inference within tissues 
using cross-individual expression variation 


To provide a view of coordinated gene regulation 
arising from both cis and trans genetic effects 
and nongenetic sources, we inferred gene-gene 
coexpression networks within tissues. Studying 
each of the nine tissues in isolation and examin- 
ing variation of gene expression across individ- 
uals, we linked pairs of genes that show correlated 
expression (top 1% of pairs by Pearson correla- 
tion) (74), revealing similar patterns of coexpres- 
sion across tissue pairs (Fig. 6A). The median 
Tt Statistic (fraction of true positive results) (26), 
estimating the total fraction of coexpressed gene 
pairs identified in one tissue that are replicated 
in a second tissue, is 0.44, ranging from 0.30 to 
0.58 (Fig. 6A). Furthermore, the specific tissue 
pairs with higher overlap of coexpressed genes 
also have a greater overlap of cis-eQTLs than 
other tissue pairs, indicating a similar pattern of 
tissue relatedness underlying both results (P < 0.05 
for correlation between similarity matrices of Figs. 
2B and 6A). Thus, although coexpression networks 
primarily capture non-cis mechanisms, including 
trans regulation and environmental factors, the 
overall level of sharing and the specific patterns 
of tissue relatedness agree with the patterns ob- 
served from cis-eQTLs, which suggests that regu- 
latory mechanisms beyond cis effects may be shared 
across tissues. 

We also used a weighted gene coexpression 
network analysis (WGCNA) approach (41) to con- 
struct coexpression networks and extract gene 
modules for each tissue (Fig. 6B). The clustering 
of coexpressed genes into modules allows us to 
identify active biological processes across tissues 
(Fig. 6C). Modules enriched for common Gene 
Ontology (GO) biological processes were ob- 
served in all nine tissues (e.g., cell cycle, protein 
transport), while other biological processes were 


only seen in specific tissues (e.g., fatty-acyl-CoA 
metabolic process enrichment seen only in adi- 
pose, Benjamini-Hochberg corrected P = 2.6 x 
107”) (14). In addition to functional annotation, 
the identified coexpression networks also en- 
abled us to search for potential transcriptional 
regulators of these modules (Fig. 6D). We found 
enrichment of transcription factor binding in 
promoter regions of genes in the same module 
using ENCODE chromatin immunoprecipita- 
tion (ChIP)-seq data (42), suggesting that some 
modules could be regulated by a large number 
of transcription factors (fig. S30A). Finally, we 
compared coexpression modules learned in 
each tissue individually to those learned in other 
tissues, based on common gene membership 
and correlation of first principal components 
of expression variation across individuals 
(fig. S30B). Many modules showed correla- 
tion between principal components to other 
modules across tissues but lacked common 
genes, demonstrating that similarity in patterns 
of variation is sometimes only visible at the 
module level. 


Gene network inference within individuals 
using cross-tissue expression variation 


The availability of multiple tissues per individual 
further allowed us to define modules of co-regulated 
genes within each individual, by correlating gene 
expression across tissues for the same individ- 
ual (14). Merging modules across individuals 
produced 117 modules (fig. S6E), containing be- 
tween 25 and 414 genes. Each of these modules 
corresponds to a multitissue expression pattern 
for a gene, enabling us to study changes in the 
regulatory program of genes that affect multi- 
ple tissues at a time. 

We used these modules to identify instances of 
coordinated variation in multitissue expression 
patterns across individuals. For each gene, we es- 
timated the proximity of each individual’s expres- 
sion pattern to the median expression pattern of 
each module, corresponding to a “module mem- 
bership score” (Fig. 6F and fig. S31A). We then 
calculated a module membership score for each 
individual (Fig. 6G) (14). Cases where depletion 
in membership of one module was accompanied 
by a corresponding increase in membership of a 
different module are called “module switching 
events.” The majority of genes showed conserved 
multitissue gene expression patterns among in- 
dividuals, remaining in the same module or switch- 
ing between modules with similar expression 
patterns (correlation distance < 0.5). However, we 
identified 3965 genes (21%) that show switching 
between dissimilar modules (correlation distance 
> 0.5), which may have important biological conse- 
quences (fig. S31B). Using module membership 
scores as a quantitative trait, we searched for 
neighboring SNPs that are correlated with these 
module membership scores (Fig. 6H), which we 
call module-switching QTLs (modQTLs). Search- 
ing a window of IMbp around each gene in cis, 
we found a total of 2102 modQTLs associated 
with statistically significant switching (FDR < 0.05) 
between dissimilar gene modules (correlation 
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distance > 0.5), suggesting that genetic varia- 
tion for those genes leads to changes in multi- 
tissue regulatory programs. For example, the 
ZFP57 gene shows three distinct patterns of mul- 
titissue expression across individuals, and three 


differences (Fig. 6H). 
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together account for only 42% of modQTL SNPs. 
Hence, 58% of modQTLs are not discovered by 
other methods, which suggests that the ap- 
proach has the potential to reveal novel genetic 
effects on the modulation of genes across tissues 


(fig. S31C). 


Personal transcriptomics and 
implications for human disease 


The in-depth analysis of multitissue transcrip- 
tomes enables both an understanding of the 
population-level properties of the transcriptome 
as well as individual level properties inferred from 
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As is the case for personalized genomics, this 
individual-level transcriptome analysis is likely 
to become a crucial addition to the personalized 
assessment of an individual’s biology and likely 
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challenge. Variants predicted to result in the 
truncation of proteins (splice, stop-gain SNVs, 
or frame-shift indels) may have large effects on 
biochemical function, but are also highly enriched 
for annotation artifacts (43). Errors in predict- 
ing the true functional impact of these and oth- 
er variants can substantially reduce discovery 
power in common complex diseases and, more 
important, can affect disease diagnosis in clinical 
settings (44). 

The GTEx multitissue expression data pro- 
vide an opportunity to assess the real impact of 
protein-truncating variants (PTVs) on the human 


Fig. 7. Integration of transcriptome data im- 
proves annotation of putative protein truncating 
variants (PTVs). (A) The majority of annotated 
PTV variants are partial PTV, meaning that only a 
fraction of the RNA-seq transcripts support PTV 
annotation. (B) For all the predicted PTV variants, 
we ask what percentage of variants maintain a PTV 
annotation if we require that a fixed percentage of 
the dominant isoforms across all sequenced tis- 
sues support a PTV prediction; 70% of PTV var- 
iants are relevant if the threshold is 10%, whereas 
only 40% of PTV variants are relevant if the thresh- 
old is 100%. 


Fig. 8. Tissue-dependent GWAS eQTL enrichment 
Q-Q plots. (A) eQTLs are enriched for trait associ- 
ations with an important class of complex diseases. 
eQTLs discovered in whole blood (plotted in red) 
show significant enrichment for SNPs associated 
with autoimmune disorders from the WTCCC study 
(type 1 diabetes, Crohn’s disease, and rheumatoid 
arthritis) relative to null expectation (shown in gray) 
defined by non-eQTLs. (B) Enrichment of eQTLs for 
disease associations is tissue-dependent. Single- 
tissue eQTL annotation can be used to increase 
power to detect associations with hypertension, a 
disease for which the WTCCC study failed to yield 
significant associations. Notably, eQTLs discovered 
in adipose are enriched relative to muscle, lung, 
thyroid, skin, heart, and tibial artery (P < 0.05, 
Kolmogorov-Smirnov test) for known SNP associa- 
tions with the hypertension. 
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transcriptome [see also (32)]. We combined 
exome sequencing and RNA-seq data from 173 
GTEx individuals to assess the global proper- 
ties of predicted high-confidence PTVs (14). 
PTVs are enriched in alternatively spliced exons, 
with just 38.4% of high-confidence PTVs having 
annotation support across all reported tran- 
scripts (Fig. 7A), and only 51 to 55% supported 
by the major transcript of at least one tissue (for 
all tissues with at least 10 samples). These num- 
bers highlight the need for careful transcript- 
specific assessment of functional annotation 
for all classes of variation. Furthermore, if we 
require that a fixed percentage of the dominant 
isoforms across all sequenced tissues support 
this annotation, we find that the percentage of 
predicted PTVs with annotation support of 
PTV decreases as we increase the threshold for 
the proportion of tissues with major transcript 
isoform support for PTV prediction (Fig. 7B). 
This highlights the need for empirically derived 
reference transcript sets across a broad array of 
tissues to enhance clinical interpretation for per- 
sonal genome sequencing and disease studies. 


An example with clinical ramifications is shown 
in fig. $32. 


GWAS and eQTLs 


The ultimate goal of the GTEx project is to pro- 
vide a framework for biological interpretation of 
disease-related variants. To evaluate the relevance 
of the discovered eQTLs in disease mapping 
studies, we tested the eQTLs identified in each 
tissue for association with disease using the Wellcome 
Trust Case Control Consortium (WTCCC) studies 
of seven complex disorders (45) (see Fig. 8A). 
Using eQTLs identified in whole blood, we found 
an enrichment for top associations with autoim- 
mune diseases (shown as a leftward shift from 
the null distribution in the Q-Q plot in Fig. 8A), 
for Crohn’s disease, rheumatoid arthritis, and 
type 1 diabetes among eQTLs (P < 2.2 x 10° for 
each disease), consistent with the utility of blood 
and lymphoblastoid cell lines (LCLs) in trait map- 
ping for autoimmune disorders (46, 47). In con- 
trast to the autoimmune disorders, we found no 
enrichment for association (no shift in P value 
distribution) with bipolar disorder or type 2 


diabetes among the blood eQTLs (P > 0.05) 
(Fig. 8A). This tissue specificity of autoimmune 
enrichment results suggests they are not due to 
the confounding effects that result from similar 
underlying genomic properties between eQTL and 
disease association regions. More generally, we 
observed trait-specific levels of enrichment for 
the WTCCC disease traits among the nine differ- 
ent single-tissue eQTL sets. 

Remarkably, the use of eQTLs increased pow- 
er to detect associations with hypertension 
(Fig. 8B). In particular, eQTLs in subcutaneous 
adipose were significantly enriched for multiple 
associations with hypertension relative to muscle, 
lung, thyroid, skin, heart, and tibial artery (P < 
0.05, Kolmogorov-Smirnov test) (Fig. 8B). This 
is particularly noteworthy because the WT'CCC 
GWAS of this disease did not yield any genome- 
wide significant associations, which suggests 
that larger sample sizes were required to identify 
highly significant SNP associations in the ab- 
sence of functional data. Because the majority 
of GWAS-identified variants (~95%) lie in non- 
coding regions of the genome, we determined 
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Fig. 9. A blood pressure-associated SNP is a significant eQTL in tibial 
artery, for ARHGAP42 and TMEM133. (A) The GWAS SNP, rs633185 in the 
intron of ARHGAP42, is associated with systolic blood pressure (P = 1.2 x 10°”) 
and diastolic blood pressure (P = 2 x 107). This GWAS SNP is in tight LD (r = 
0.93) with the most significant eQTL for ARHGAP42 in tibial artery, rs604723 
(P =1x 107%), and is the most significant eQTL for TMEMI133 in tibial artery 
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(P = 2.7 x 10°). Tibial artery was the only significant tissue at FDR < 0.05 
according to the single-tissue eQTL discovery method. (B) Average posterior 
probability of the most significant cis-eQTL, rs607562 for ARHGAP42 at FDR < 
0.05 from the multitissue eQTL methods. (C) Similar plot for TMEM133. The 
most significant cis-eQTL for TMEM133 from the multitissue methods at FDR < 
0.05 is the GWAS SNP, rs633185, in tibial artery. 
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which genome-wide significant trait associations 
(P < 5 x 10°®) reported to date are in LD with 
at least one GTEx-identified eQTL. We merged 
NCBI’s Phenotype-Genotype Integrator (PheGenI) 
(48) and the NHGRI GWAS catalog (49), yielding 
10,129 genome-wide significant SNP associations 
with nearly 630 distinct complex traits. In to- 
tal, 5195 “independent” SNPs were identified 
after LD pruning at 7° = 0.8 and counting SNPs 
only once (J4). Of these, 308 (~6%) are in strong 
LD (7° = 0.8), with a “best eQTL per gene” (at FDR < 
0.05) from either the single-tissue or multitis- 
sue eQTL discovery analysis (table S12) in at 
least one of the nine tissues tested. For two- 
thirds of these cases (211 SNPs), no putative del- 
eterious coding variants (nonsynonymous or splice 
variants) in the target gene product lie in LD 
(7 = 0.8) with the GWAS SNP; this finding sug- 
gests that regulatory effects may underlie the 
causal mechanism, although additional work is 
needed to prove causality. GWAS SNPs in LD 
with an eQTL show a factor of 2 higher repre- 
sentation in coding regions relative to all GWAS 
SNPs (11% versus 4.6%; table S13). Notably, about 
one-third of the eQTLs in LD with GWAS SNPs 
were detected only with methods that leverage 
the multitissue nature of GTEx data. Increasing both 
sample sizes and the range of tissues will likely in- 
crease the number of detected GWAS-eQTL loci. 

Annotating a GWAS SNP with an eQTL can 
help to highlight candidate causal genes within a 
locus (i.e., the eQTL target gene). We found that 
proximity-based and eQTL-based gene assignments 
for GWAS SNPs were often discordant (47). A 
surprising proportion of trait-associated SNPs 
in LD (7° = 0.80) with a GTEx eQTL showed dis- 
agreement between the strongest eQTL-derived 
target gene and the genes that were physically 
proximal to the GWAS SNP (table S14). Of 190 
GWAS loci (P < 5 x 107°) where the lead SNP is 
an eQTL from the single-tissue analysis (FDR < 
0.05) with only a single target gene, in 65 cases 
(~34%) this eQTL target gene differs from any of 
the genes that were closest to the SNPs in LD. 
These results were also observed when we re- 
stricted the target genes to protein-coding genes, 
when we pruned the GWAS SNPs for each trait 
examined (7? = 0.80), and when we used the 
eQTLs identified from the multitissue joint eQTL 
analysis. 

In addition to prioritizing causal genes in 
GWAS loci, an eQTL catalog from multiple hu- 
man tissues can highlight the relevant tissue(s) 
of action, evaluate the tissue specificity of GWAS 
loci, and characterize pleiotropic associations. We 
demonstrate the value of multitissue data to ex- 
plore and resolve these issues for the GWAS intronic 
SNP, 1s633185, located in ARHGAP42 (Fig. 9). This 
GWAS SNP is in high LD (7° = 0.93) with the best 
cis-eQTL (18604723) targeting ARHGAP42, and the 
best cis-eQTL for a neighboring gene, TMEM133 
in tibial artery. Evaluating eQTL significance in 
all nine tissues shows that although the eQTL’s 
significance is indeed strongest in tibial artery, sev- 
eral other tissues may merit consideration in ex- 
ploring the causal function of this locus, such as 
subcutaneous adipose and skin (Fig. 9, B and C, 
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and fig. S33). This supports the need to explore 
the genetic basis of disease in the fuller context of 
a wide range of human tissues. The GTEx eQTLs 
may also be useful in highlighting the role of non- 
coding genes in disease risk and other complex 
traits (fig. S34). We found that ~13% of candidate 
genes proposed by GTEx eQTLs, and in LD to 
genome-wide significant GWAS SNPs, are non- 
coding genes (table S15). 


Conclusions 


We have described a large in-depth data set of 
multitissue human gene expression. We assessed 
the variability of the transcriptome among indi- 
viduals in a large number of tissues at a resolution 
that provides unique insights in to the diversity 
and regulation of gene expression among tissues. 
This analysis provides a unified view of genetic 
effects on gene expression across a broad range 
of tissue types, most of which have not been 
studied for eQTLs previously. We look forward 
to scaling up the resource to create a data set 
that will transform our understanding of how 
genetic variability influences different tissues 
and biological systems and ultimately complex 
diseases. 
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HUMAN GENOMICS 


The human transcriptome across 
tissues and individuals 
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Transcriptional regulation and posttranscriptional processing underlie many cellular and 
organismal phenotypes. We used RNA sequence data generated by Genotype-Tissue 
Expression (GTEx) project to investigate the patterns of transcriptome variation across 
individuals and tissues. Tissues exhibit characteristic transcriptional signatures that show 
stability in postmortem samples. These signatures are dominated by a relatively small 
number of genes—which is most clearly seen in blood—though few are exclusive to 

a particular tissue and vary more across tissues than individuals. Genes exhibiting high 
interindividual expression variation include disease candidates associated with sex, 
ethnicity, and age. Primary transcription is the major driver of cellular specificity, with 
splicing playing mostly a complementary role; except for the brain, which exhibits a more 
divergent splicing program. Variation in splicing, despite its stochasticity, may play in 
contrast a comparatively greater role in defining individual phenotypes. 


ene expression is the key determinant of 
cellular phenotype, and genome-wide ex- 
pression analysis has been a mainstay of 
genomics and biomedical research, pro- 
viding insights into the molecular events 


underlying human biology and disease. Whereas 
expression data sets from tissues/primary cells 
C, 2) and individuals (3) have accumulated over 
recent years, only limited expression data sets have 
allowed analysis across tissues and individuals 


sciencemag.org SCIENCE 


RESEARCH | REPORTS 


simultaneously (4). The Genotype-Tissue Ex- 
pression Project (GTEx) is developing such a 
resource (5, 6), collecting multiple “nondiseased” 
tissues sampled from recently deceased human 
donors. We analyzed the GTEx pilot data freeze 
(6), which comprised RNA sequencing (RNA- 
seq) from 1641 samples from 175 individuals 
representing 43 sites: 29 solid organ tissues, 
11 brain subregions, whole blood, and two cell 
lines: Epstein-Barr virus-transformed lympho- 
cytes (LCL) and cultured fibroblasts from skin 
[table S1 and (7)]. 

The identification and characterization of 
genetic variants that are associated with gene 
expression are extensively discussed in (6). Here 
we use the GTEx data to investigate the pat- 
terns of transcriptome variation across indi- 
viduals and tissues and how these patterns 
associate with human phenotypes. RNA-seq per- 
formed on the GTEx pilot samples produced 
an average of 80 million paired-end mapped 
reads per sample (fig. S1) (7, 8). We used the 
mapped reads to quantify gene expression using 
Gencode V12 annotation (9), which includes 
20,110 protein-coding genes (PCGs) and 11,790 
long noncoding RNAs (IncRNAs). Comparison 
with microarray-based quantification for a sub- 
set of 736 samples showed concordance be- 
tween the two technologies (average correlation 
coefficient = 0.83, fig. S2). At the threshold de- 
fined for expression quantitative trait loci (eQTL) 
analysis [reads per kilobase per million mapped 
reads (RPKM) > 0.1, see (7)], at which 88% of 
PCGs and 71% of IncRNAs are detected in at least 
one sample, the distribution of gene expression 
across tissues is U-shaped and complementary 
between PCGs (generally ubiquitously expressed) 
and IncRNAs (typically tissue-specific or not ex- 
pressed, Fig. 1A). 

Tissues show a characteristic transcription- 
al signature, as revealed by multidimensional 
scaling, of both PCG and IncRNA expression 
(figs. 1B, S3, and S4), with individual pheno- 
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types contributing little (fig. S5). The primary 
separation, as observed in prior studies (10), 
is between nonsolid (blood) and solid tissues 
and, within solid tissues, brain is the most dis- 
tinct. Brain subregions are not well differen- 
tiated, with the exception of cerebellum (fig. S6). 
Postmortem ischemia appears to have little im- 
pact on the characteristic tissue transcriptional 
signatures, as previously noted (77). In a com- 
parison of 798 GTEx samples with 609 “nondis- 
eased” samples obtained from living (surgical) 
donors (table S2), we found that GTEx samples 
clustered with surgical samples of the same tis- 
sue type (Fig. 1C and table S3) (72). 

Tissue transcription is generally dominated 
by the expression of a relatively small number 
of genes. Indeed, we found that for most tis- 
sues, about 50% of the transcription is accounted 
for by a few hundred genes (13). In many tissues, 
the bulk of transcription is of mitochondrial 
origin (Fig. 1D and table S4) (14). In kidney, 
for instance, a highly aerobic tissue with many 
mitochondria, a median of 51% (>65% in some 
samples) of the transcriptional output is from 
the mitochondria (fig. S7). Other tissues show 
nuclear-dominated expression; in blood, for ex- 
ample, three hemoglobin genes contribute more 
than 60% to total transcription. Genes related 
to lipid metabolism in pancreas, actin in muscle, 
and thyroglobulin in thyroid are other exam- 
ples of nuclear genes contributing dispropor- 
tionally to tissue-specific transcription. Because 
RNA samples are generally sequenced to the same 
depth, in tissues where a few genes dominate 
expression, fewer RNA-seq reads are compar- 
atively available to estimate the expression of 
the remaining genes, decreasing the power to 
estimate expression variation. These tissues— 
i.e., blood, muscle, and heart (Fig. 1E)—are, con- 
sequently, those with less power to detect eQTLs 
(6). Because most eQTL analyses are performed 
on easily accessible samples, such as blood, this 
highlights the relevance of the GTEx multitissue 
approach. 

Although thousands of genes are differentially 
expressed between tissues (fig. S8) or show tissue- 
preferential expression (fig. S9 and table S5), 
fewer than 200 genes are expressed exclusively 
in a given tissue (figs. S10 and S11 and tables S6 
and S7, A to E). The vast majority (~ 95%) are 
exclusive to testis and many are IncRNAs. This 
may reflect low-level basal transcription com- 
mon to all cell types or result from general tissue 
heterogeneity, with few primary cell types being 
specific to a given tissue. 

Expression of repetitive elements also reca- 
pitulates tissue type (table S8 and fig. S12A). 
We identified 3046 PCGs whose expression, in 
at least one tissue, was correlated with the ex- 
pression of the closest repeat element (on aver- 
age 2827 base pairs away, fig. S12B). In about 
half of these cases, the repeat was also signif- 
icantly coexpressed with other repeats of its 
same family (table S8 and fig. S13). LacRNA 
expression can be regulated by specific repeat 
families (15), and we found evidence that testis- 
specific expression could be regulated by endog- 


enous retrovirus L repeats (ERVL and ERVL- 
MaLR) (fig. S12C). 

Using linear mixed models, we found that 
variation in gene expression is far greater among 
tissues (47% of total variance in gene expression) 
than among individuals (4% of total variance, 
Fig. 2A and table S9), and very similar for PCGs 
and IncRNAs when controlling for gene expres- 
sion (Fig. 2A). Genes that show high expression 
variance across individuals and low variance 
across tissues include genes on the sex chromo- 
somes, as well as autosomal genes, such as the 
RHD gene that determines Rh blood group. 

We identified 92 PCGs and 43 IncRNAs with 
global sex-biased expression [false discovery rate 
(FDR) < 0.05, Fig. 2B and table S10]. Genes over- 
expressed in males are predominantly located 
on the Y chromosome. Conversely, many genes 
on the X chromosome are overexpressed in fe- 
males, suggesting that more genes might es- 
cape X inactivation than previously described 
(16). Among these, we found XIST and JPX, 
known to participate in X inactivation, as well 
as the IncRNAs RP11-309M23.1 and RP13-216E22.4, 
the expression of which shows enrichment in 
the nucleus in female cell lines from ENCODE 
(17) and hence could be candidates to also 
participate in X inactivation (fig. S14) (J6). 
Among autosomal PCGs, MMP3, linked to sus- 
ceptibility to coronary heart disease [Online 
Mendelian Inheritance in Man (OMIM) no. 
614466] and more prevalent in males, shows 
the strongest expression bias (Fig. 2B). 

We detected 221 PCGs and 153 IncRNAs glob- 
ally differentially expressed between individu- 
als of European and African-American ancestry 
(FDR < 0.05, Fig. 2C and table S11). There is a 
slight enrichment of IncRNAs (P < 1 x 10°), 
among which we identified the RP11-302J23.1 
gene, highly expressed in cardiac tissue in Af- 
rican Americans only, and located in a region 
that harbors weak associations to heart disease 
(18). Additionally, some genes showing differ- 
ential expression by ethnicity lie in genomic re- 
gions under positive selection in European or 
sub-Saharan African populations (Fig. 2C and 
fig. S15). 

Finally, we detected 1993 genes that globally 
change expression with age (FDR < 0.05, Fig. 2D 
and table S12). Genes that decrease expression 
are enriched in functions and pathways related 
to neurodegenerative diseases such as Parkinson’s 
and Alzheimer’s diseases, among which eight har- 
bor single-nucleotide polymorphisms (SNPs) 
for these diseases identified from genome-wide 
association studies (P < 0.05). Among the genes 
that increase expression with age is EDA2R, whose 
ligand, EDA, has been associated with age-related 
phenotypes (19). 

We also identified 753 genes with tissue spe- 
cific sex-biased expression (FDR < 0.05, table S13) 
predominantly in breast tissue (92%), and 31 
genes with tissue-specific ethnicity-biased expres- 
sion, many in the skin (FDR < 0.05, Table 1 and 
table S14). Among the sex-differentially expressed 
genes, five show biased expression specifically 
in heart and are of interest given the differing 
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prevalence of cardiovascular disease between 
males and females. One of these genes, PLEKHA7 
(fig. S15C), contains SNPs associated with risk 
for cardiovascular disease. 

Overall, tissue specificity is likely to be driven 
by the concerted expression of multiple genes. 
Thus, we performed sex-based differential anal- 


ysis of coexpression networks. We identified 
42 coexpression modules in males and 46 in 
females (fig. S16). Among male-specific mod- 
ules, we found one related to spermatid differ- 
entiation and development (FDR = 9.0 x 10°“, 
fig. SI6B), and among female-specific modules, 
we found one related to epidermis and ecto- 


derm development (FDR = 4.6 x 10, fig. 
S16C). Differential network expression, there- 
fore, distinguishes differences between male 
and females not well captured by analysis of 
individual genes. 

Split-mapped RNA-seq reads predict about 
87,000 novel junctions with very strong support 
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Fig. 1. The GTEx multitissue transcriptome. (A) Gene expression levels 
and number of tissues in which genes are expressed (>0.1 RPKM in at 
least 80% of the samples). RPKMs are averaged over all genes expressed 
in a given number of tissues. (B) Sample and tissue similarity on the 
basis of gene expression profiles. Left: Multidimensional scaling Right: 
Tissue hierarchical clustering. (©) Expression values from eight GTEx 
tissues (colored circles) plotted radially along seven metagenes ex- 
tracted from expression data. Antemortem samples curated from the 
Gene Expression Omnibus (GEO) cluster strongly with GTEx tissues. (D) 
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Transcriptome complexity. Bottom: Cumulative distribution of the av- 
erage fraction of total transcription contributed by genes when sorted from 
most-to-least expressed in each tissue (x axis). Lines represent mean values 
across samples of the same tissue, and lighter-color surfaces around the 
mean represent dispersion calculated as the standard deviation divided by 
the cumulative sum of all means. Top: Biological type and relative contribution 
to total transcription of the hundred most expressed genes. Height of the 
bars is proportional to the fraction that these genes contribute to total 
transcription. 
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(fig. S17). These tend to be more tissue spe- 
cific, detected in fewer samples, and less con- 
served than previously annotated junctions 
(only 2.6% of novel junctions can be detected 
as orthologous in mouse, compared to 65% for 
annotated junctions). Multidimensional scal- 
ing based on exon inclusion levels again large- 
ly recapitulates tissue type (Fig. 3A). However, 
samples from brain cluster as the primary out- 
group, supporting the existence of a distinct 


splicing program in the brain (20). Furthermore, 
preferential gene expression of RNA-binding pro- 
teins and both differential and preferential 
exon inclusion are enriched in the brain (figs. 
$18 and S19 and table S15). We found very few 
exons exclusively included or excluded in a given 
tissue (fig. S20 and table S16), 40% of which 
show exclusive inclusion in the brain. We also 
found that micro-exons (<15 bp) are overwhelm- 
ingly used in the brain compared to other tis- 


sues (Wilcoxon test, P < 1 x 10~’, Fig. 3B). This 
pattern is not obvious in short exons longer than 
15 bp (P = 0.3, fig. S21). This observed brain- 
specific splicing pattern may result from differen- 
tial splicing in the cerebellum, because expression 
clustering of the brain regions reveals a general 
up-regulation of RNA-binding proteins specifi- 
cally in the cerebellum (Fig. 3C). This is also the 
brain region exhibiting the largest proportion of 
novel splicing events (fig. S22). 
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Fig. 2. Gene expression across tissues and individuals. (A) Left: Con- 
tribution of tissue and individual to gene expression variation of PCGs 
and IncRNAs. Bottom right: Mean + SD over all genes (filled circles) and 
over genes with similar expression levels in PCGs and IncRNAs (unfilled 
circles). Circle size is proportional to the sum of tissue and individual 
variation, and segment length corresponds to 0.5 SD. Top right: genes 
with high individual variation and low tissue variation. (B) Sex differen- 
tially expressed genes. Top: differentially expressed genes (FDR < 0.05) 
sorted according to expression differences between males and females. 
Genes in the Y chromosome are sorted according to the expression in 
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males. Bottom: MMP3 gene expression in males and females. (©) Genes 
differentially expressed with ethnicity. Top: differentially expressed genes 
(FDR < 0.05) between African Americans (AA) and European Americans 
(EA) sorted according to expression differences. A few of these genes lie 
in regions reported to be under positive selection in similar populations. 
Bottom: expression of RP11-302J23.1. (D) Genes differentially expressed 
with age. Top: Genes sorted according to the regression coefficient. 
Bottom: expression of EDAR2 gene in nerve and artery as a function of 
age. Shaded area around the regression line represents 95% confidence 
interval. 
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Fig. 3. Splicing across tissue and individuals. (A) Multidimensional scaling 
of all samples on the basis of exon inclusion levels (Percent spliced in, PSI). 
(B) Microexon inclusion across tissues. Values of tissue exon inclusion close 
to 1 (-1) indicate that the microexon is included (excluded), in nearly all 
samples from the tissue, and excluded (included) in nearly all samples from 
the rest of the tissues. Tissues are sorted according to tissue exon inclusion 
(phi) median value. (©) Clustering of brain samples on the basis of the 
normalized expression levels of 67 RNA binding proteins involved in splicing. 
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The order of samples and genes is obtained by biclustering the expression 
matrix. (D) Left: Contribution of tissue and individual to splicing variation in 
PCGs. Bottom right: Mean + SD of individual and tissue contributions to 
splicing and to gene expression variation. Circle size is proportional to the 
sum of tissue and individual variation and segment length corresponds to 
0.5 SD. Top right: Genes with high splicing variation across individuals. (E) 
Contribution of gene expression to the between-individual and between- 
tissue variation in isoform abundance 
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Table 1. Genes with sex-biased and ethnicity-biased expression in GTEx tissues. Differentially expressed genes between males (M) and females (F) and 
between African American (AA) and European American (EA) in those tissues with at least 10 samples per group. Median fold change (on autosomal genes) was 
calculated for tissues with more than two significant genes. 


No. of 
samples 
MF All 
genes 
Adipose 76 37 = 45 29 
Artery 88 57 40 21 
Blood 99 57 20 12 
Breast 14 13 762 567 
Esophagus 27 il 23 14 
Heart 74 34 27 15 
LCL 26 13. 23 ll 
Lung 76 43 34 17 
uscle 87 51 42 27 
Nerve 54 34 38 24 
Skin 76 43 47 32 
Thyroid 65 40 42 25 
0.ofsamples 942 566 
o. of individuals 111 64 847 628 


In contrast to gene expression, variation of 
splicing, measured either from relative iso- 
form abundance or exon inclusion, is similar 
across tissues and across individuals, but ex- 
hibits a much larger proportion of residual un- 
explained variation (Fig. 3D, fig. $23, and table 
$17). This could arise from nonadditive inter- 
actions between individuals and tissues, but 
might also reflect stochastic, nonfunctional fluc- 
tuations that are more common in splicing than 
in expression (27). Among the genes that show 
high interindividual splicing variability, we found 
an enrichment of ribosomal proteins and genes 
related to translation and protein biosynthe- 
sis (Fig. 3D and table S18). Higher variability 
between individuals may also partially reflect 
an effect of ischemic time on splicing, which 
we observed when clustering samples by exon 
inclusion within each tissue (fig. $24). 

The abundance of splicing isoforms reflects 
the actions of both primary transcription and 
posttranscriptional processing—mostly alter- 
native splicing. To determine the relative con- 
tribution of each process, we estimated the 
proportion of variance in isoform abundance 
that can be simply explained by variance in 
gene expression. We found that gene expres- 
sion explains only 45% of the variance between 
individuals, but 84% of the variance between 
tissues (Fig. 3E and fig. S25). This strongly 
suggests that primary transcription is the main 
driver of cellular specificity, with splicing play- 
ing a complementary role. Although this may 
be unexpected, given the magnitude of the ef- 
fect, it is consistent with recent findings of low 
proteomic support for alternatively spliced iso- 
forms (22) and few shifts in major protein iso- 
forms across cell types (table S19) (23). 
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PCGs IncRNA 


Sex 


No. of 
genes 


Y Pseudo- 


chrom. chrom. 


region 
16 28 4 1 12 
19 29 2 O 9 
8 18 1 0 1 
195 26 21 0 715 
9 22 1 O O 
12 23 1 O 3 
12 21 1 fe) 
17 31 1 0 2 
15 24 6 0 12 
14 25 5 7 
15 31 5 O ll 
17 27 5 9 
219 Al 34 771 


Overall, our results underscore the value 
of monitoring the transcriptome of multiple 
tissues and individuals in order to understand 
tissue-specific transcriptional regulation and 
to uncover the transcriptional determinants 
of human phenotypic variation and disease 
susceptibility. 
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Accurate prediction of the functional effect of genetic variation is critical for clinical 
genome interpretation. We systematically characterized the transcriptome effects of 
protein-truncating variants, a class of variants expected to have profound effects on 
gene function, using data from the Genotype-Tissue Expression (GTEx) and Geuvadis 
projects. We quantitated tissue-specific and positional effects on nonsense-mediated 
transcript decay and present an improved predictive model for this decay. We directly 
measured the effect of variants both proximal and distal to splice junctions. Furthermore, 
we found that robustness to heterozygous gene inactivation is not due to dosage 
compensation. Our results illustrate the value of transcriptome data in the functional 


interpretation of genetic variants. 


enetic variants predicted to shorten the 
coding sequence of genes—termed protein- 
truncating variants (PTVs)—are typically 
expected to have large effects on gene 
function. These variants are enriched for 
disease-causing mutations (J, 2), but some may be 
protective against disease (3). However, PTVs are 
abundant in the genomes of healthy individuals (4), 


indicating that they often do not have major 
phenotypic consequences. In addition, although 
PTVs are often described as loss-of-function (LOF) 
variants, in most cases their precise molecular 
effect has not been characterized and in other 
cases show gain-of-function effects (1). Clinical 
interpretation of PTVs will thus require direct 
characterization of their biochemical effects. 


We cataloged predicted PTVs and their tran- 
scriptomic effect in 462 healthy individuals with 
DNA and mRNA sequencing (RNA-seq) from 
lymphoblastoid cell lines (LCLs) in the Geuvadis 
study (5, 6) and 173 individuals with exome se- 
quencing and RNA-seq from a total of 1634: samples 
from multiple tissues in the Genotype-Tissue 
Expression (GTEx) study [supplementary ma- 
terials section S1 (SM S1)] (7, 8). Each GTEx 
individual has RNA-seq data from 1 to 30 tis- 
sues, with 9 tissues having >80 samples. We de- 
fined PTVs (4) (table S1) as single-nucleotide 


lWellcome Trust Centre for Human Genetics, Nuffield 
Department of Clinical Medicine, University of Oxford, 
Oxford, UK. “Institute for Molecular Medicine Finland (FIMM), 
University of Helsinki, Helsinki, Finland. "Washington 
University in St. Louis, St. Louis, MO, USA. “Broad Institute 
of MIT and Harvard, Cambridge, MA, USA. °Analytical and 
Translational Genetics Unit, Massachusetts General Hospital, 
Boston, MA, USA. ®Department of Genetics, Stanford 
University, Stanford, CA, USA. Department of Pathology, 
Stanford University, Stanford, CA, USA. °Biomedical 
nformatics Program, Stanford University, Stanford, CA, USA. 
°Department of Psychiatry, Mt. Sinai Hospital, NY, USA. 
10Department of Genetic Medicine and Development, 
University of Geneva, Geneva, Switzerland. “Institute for 
Genetics and Genomics in Geneva (iGE3), University of 
Geneva, Geneva, Switzerland. Swiss Institute of 
Bioinformatics, Geneva, Switzerland. ‘Division of Psychiatric 
Genomics, Department of Psychiatry, Icahn School of 
Medicine at Mount Sinai, NY, USA. “Institute of Medical 
Genetics, School of Medicine, Cardiff University, Heath Park, 
Cardiff, UK. Center for Genomic Regulation (CRG), 
Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, 
Spain. “Department of Statistics, University of Oxford, 
Oxford, UK. ‘/National Institute for Scientific Computing 
(LNCC), Petropolis, Rio de Janeiro, Brazil. “Oxford Center 
for Diabetes Endocrinology and Metabolism, University of 
Oxford, Oxford, UK. *?New York Genome Center, New York, 
NY, USA. Department of Systems Biology, Columbia 
University, New York, NY, USA. “Department of Medicine, 
Harvard Medical School, Boston, MA, USA. 

*Corresponding author. E-mail: rivas@well.ox.ac.uk (M.A.R.); 
tlappalainen@nygenome.org (T.L.); macarthur@atgu.mgh. 
harvard.edu (D.G.M.) {These authors contributed equally to 
this work. {These authors contributed equally to this work. 


STUDY 
COHORT 
© GTEx 
173 individuals 
1634 samples 


exome sequencing 
multi-tissue RNA-seq 


@® Geuvadis 
462 individuals 
low-pass whole 
genome sequencing 
(1000 Genomes) 
LCL RNA-seq 


Fig. 1. Schematic overview of the study. We prepared an integrated DNA and RNA sequencing data se 
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variants (SNVs) predicted to introduce a pre- 
mature stop codon or to disrupt a splice site, 
small insertions or deletions (indels) pre- 
dicted to disrupt a transcript’s reading frame, 
and larger deletions that remove the full pro- 
tein coding sequence (CDS) (SM 82, Fig. 1, and 
figs. S1 and S2). We identified 13,182 candidate 
PTVs using phase 1 data of the 1000 Genomes 
Project (9) of the 421 individuals included in 
the Geuvadis RNA-seq project, as well as 
4584 candidate PTVs in the GTEx data, for a 
combined total of 16,286 candidate variants 
(table S2). 

We measured total gene expression levels in 
reads per kilobase of exon per million mapped 
reads, allele-specific expression (ASE) detecting 
different expression levels of two haplotypes of 
an individual, and split mappings across anno- 
tated exon junctions to quantify splicing (SM S3 
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and S4). Transcripts containing common PTVs 
are more weakly expressed and more tissue-specific 
than transcripts that do not contain common 
PTVs (SM S65 and figs. S3 to $7), consistent with 
previous work (4). 

PTVs that generate premature stop codons 
may trigger nonsense-mediated decay (NMD). 
Such variants are often recessive and may pro- 
tect against detrimental phenotypic effects but 
also may cause disease via haploinsufficiency 
(1). Variants that escape NMD may create a trun- 
cated protein with dominant-negative or gain- 
of-function effects (1). We compared transcript 
levels between the PTV and the non-PTV al- 
leles within the same individual (SM S6) (4, 5, 10) 
for a total of 1814 PTVs (SM S6, figs. S8 to S12, 
and table S3) and validated the allelic ratios 
obtained from RNA-seq data (figs. S13 to S18 
and table S4) (77). We also generated a method 
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to assess the ASE effect of frameshift indels (SM 
S6 and figs. S8 to S12), which were not previ- 
ously examined (5, 10) due to the technical chal- 
lenges of mapping bias (12-14). 

Allelic count data were analyzed with a Bayesian 
statistical method to address whether a variant 
exhibits ASE in a given tissue and whether this 
signal is shared across multiple tissues of the 
same individual (SM $7 and figs. S19 to S26) (15). 
We observe a higher proportion of strong or mod- 
erate allelic imbalance in rare and singleton non- 
sense SNVs compared with common nonsense 
variants (54.3%, 55.4%, and 35.7%, respectively), 
suggesting that rare PTVs are more likely to trig- 
ger NMD (fig. S19). 

Rare nonsense SNVs predicted to trigger NMD 
according to the 50-bp (base pair) rule (SM 
S7) (16) have a larger proportion of ASE than 
SNVs that escape NMD (69.5% versus 31.9%, 
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Fig. 2. Allele-specific expression analysis. (A) Proportion of rare SNVs with 
allele-specific expression (ASE) for synonymous variants (n = 25,233) and 
nonsense variants predicted to escape (n = 158) or trigger (n = 287) NMD. 
(B) Proportion of rare indels with ASE for in-frame (n = 355) and frameshift 
indel variants predicted to escape (n = 77) or trigger (n = 129) NMD. Due to 
different quality filters, the proportions are not directly comparable to those 
in (A). (C) Receiver operating characteristic curve for predicting NMD with 
binary classification defined as no ASE (escape) and moderate, strong, or 
heterogeneous ASE (trigger). The filled circles show the specificity and 
sensitivity for NMD prediction with alternative simple distance rules (inset). 
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(D) Multitissue ASE classification for rare nonsense variants predicted to trigger 
NMD (n = 287). (E) Example of ASE data across six tissues for a heterozygous 
carrier of the nonsense variant rs149244943 in gene PHKB (phosphorylase 
kinase, beta) classified as having heterogeneous ASE effects across the six 
tissues. We confirmed that this effect is not driven by a common tissue-specific 
expression quantitative trait locus. (F) Example of ASE data across 16 tissues 
for a heterozygous carrier of the nonsense variant rs119455955, a disease 
mutation for recessive late-infantile neuronal ceroid lipofuscinosis in gene TPP1 
(tripeptidy! peptidase 1), classified as having moderate ASE across all tissues. 
For all plots, 95% Cls are shown. 


8 MAY 2015 * VOL 348 ISSUE 6235 667 


RESEARCH | REPORTS 


oa ° 
fo) (oe) 


2 
ns 


Proportion of variants 
disrupting splicing: (2) 


i 
ip 
* 


‘ all 


| |i 


fl 


O Unknown @ Exon skipping © Exon elongation © Mixture 


DONOR $PLICE SITE 


eae 1 


ACCEPTOR SPLICE SITE 


Distance to © 86 2 DiAvane 2 et RA cain e we 

D 0.5- ' ' 
Sel am Ha 
a | | | : I | 

ep 2) ATLANTA, errrprseememeererecmm ll DTT 
— | | 

i : conceit 
H 1 cecercrelMecrefianecoee ee fre 


Fig. 3. Splicing disruption. (A) Proportion of variants disrupting splicing at 
each distance +25 bp from donor and acceptor site (*P < 0.05, **P < 0.01, 
***P < 0.001; green for P < 0.05; upper limit of 95% Cl is shown; P value 
evaluated using the estimated proportion of variants supporting the alter- 
native distribution times the effect size of the alternative distribution). (B) 
Classification of splice disruption events: exon skipping (low exon quantifica- 
tion value, no effect on intron quantification), exon elongation (high intron 
quantification value, no effect on exon quantification), and mixture (high 


intron and low exon quantification values). (C) Diagram of donor and acceptor 
splice junctions and sequence logo of represented sequences. (D) Effect size 
estimates (in standard deviations from the population distribution; 95% Cl is 
shown) of the variants on splice junction quantification value. (E) Median 
genomic evolutionary rate profiling score (GERP) of all variants. (F) Distribu- 
tion of common variants identified in an independent exome sequencing 
study of 4500 Swedish individuals. (G) Distribution of reported disease- 
causing variants in ClinVar. 


respectively), and both classes demonstrate ASE 
more often than synonymous variants (7.9%, 
P < 0.001 across all comparisons, two-proportion 
& test) (Fig. 2A). A higher proportion of ASE is 
also observed for frameshift indels predicted to 
trigger NMD (52.1%) compared with those pre- 
dicted to escape NMD (30.6%) and at higher 
levels than that predicted for in-frame indels 
(18.4%) (Fig. 2B). Testing alternative simple dis- 
tance rules showed that the 50-bp rule has the 
highest predictive value (Fig. 2C). 

We next generated an improved predictive 
model for no ASE versus strong/moderate ASE 
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for all nonsense SNVs (SM S7). Our model pre- 
dicts NMD better than the 50-bp rule, with an 
area under the curve (AUC) of 80.8% [95% con- 
fidence interval (CI) 77.3 to 84.4%] compared 
to a 50-bp rule AUC of 72.9% (69.3 to 76.5% 
Cl) (Fig. 2C and figs. S21 and $22). Our results 
provide a quantitative estimate of the value of 
NMD predictions and illustrate that the 50-bp 
rule (16) remains a valuable heuristic. None- 
theless, our model improves NMD prediction, 
allows a more flexible analysis of the proba- 
bility that a variant will trigger NMD from variant 
data (fig. S21), and provides data for under- 


standing the molecular mechanisms of NUD 
(fig. S22). 

The GTEx study design allows us to study 
variation in NMD across tissues. We applied a 
Bayesian hierarchical model (SM $7) (15) to 
rare nonsense variants predicted to trigger 
NMD, according to the 50-bp rule, with ASE 
data from at least two tissues. We estimate 
that 30.5% of these nonsense variants have no 
ASE in any tissue, and 48.3% and 3.3% have 
moderate or strong ASE across all tissues, re- 
spectively. Finally, 17.9% have heterogeneous 
effects across tissues, and 8.1% of ASE effects 
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are specific to a single tissue (Fig. 2, D to F, 
and figs. S23 to S26). The tissue specificity of 
NMD implies that the same PTV may have dif- 
ferent effects across tissues, which could contrib- 
ute to tissue-specific effects of disease-causing 
mutations (/7). 

We examined whether heterozygous car- 
riers of PTVs exhibit compensatory up-regulation 
of the functional allele, which could contribute 
to tolerance of PTVs and partially explain the 
widespread haplosufficiency of human genes 
(18). Dosage compensation has been reported 
to correlate with gene expression levels (19) 
and occur in over 80% of deleted genes in Dro- 
sophila melanogaster (20). To minimize the ef- 
fect of genotyping error, we focused only on 
biallelic whole-gene deletions with strong ex- 
perimental support and manual curation (SM 
82 and figs. S27 to S29). We first analyzed the 
few examples of common whole-gene deletion 
polymorphisms (SM S8). For 5/6 of these genes, 
an additive model relating gene expression to 
gene copy number provided a better fit than 
a dominant model, providing no evidence for 
dosage compensation (table S6). Addition- 
ally, heterozygous carriers of rare deletions 
also had consistently decreased expression of 
the respective gene compared with the pop- 
ulation median (P = 1.37 x 10°, one-sided bi- 
nomial test of 11 rare PTV deletions in 25 
genes) (figs. S27 and S28). Similar results were 
obtained for 53 nonsense PTVs with strong 
ASE signals (P = 2.90 x 10°°, one-sided bi- 
nomial test) (figs. S30 and S31). These results 
suggest that full dosage compensation is rare 
for human genes. 

Disruption of splicing can result in changes 
in protein structure either via in-frame changes 
in exon structure or by introducing a prema- 
ture stop codon (27). Splicing variant anno- 
tation tools typically focus only on the two 
bases at either end of a spliced intron, “essential 
splice sites” (22), despite the fact that more 
distant sites are also known to affect splicing 
(21, 23, 24). 

Variation around splice junctions tends to 
be rare (minor allele frequencies < 0.01). We 
standardized the population distribution of 
each splice-junction quantification per tissue 
and grouped variants by their distance from 
their respective donor and acceptor sites. We 
then analyzed whether individuals carrying 
variants in these positions differ from the 
population in the quantification of the splice 
junction and the proximal exon and intron 
(Fig. 3, A to D). 

In the Geuvadis data set, up to 79% of var- 
iants in the four essential splice-site loci cause 
splice disruptions (P < 0.01, Fig. 3A; GTEx 
results, figs. S33 to S37). We also find evidence 
of splice disruption from variants outside these 
regions, especially at positions 1 to 5 bp down- 
stream of intronic donor splice sites, 1 bp into 
the adjacent exon, and also more distally— 
including the -24 position from the acceptor 
site, which likely reflects the branch-point po- 
sition required for pre-mRNA splicing (25). 
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These patterns are consistent with other es- 
timates of functional effects (Fig. 3E), deple- 
tion of common variants in exome sequencing 
data sets (Fig. 3F) (26), and a higher preva- 
lence of disease-causing mutations (Fig. 3G). 
Analyses of common variants did not cap- 
ture these patterns of enrichment (table S7, figs. 
$38 to S40, and SM SQ). Our posterior prob- 
ability estimates for sites with significant al- 
ternative distributions (P < 0.05) provide a 
resource for analyses (figs. S41 and 42 and SM 
S9 and S10). 

By drawing on data from a wide range of adult 
tissues across 635 individuals, we provide a sys- 
tematic assessment of the effect of predicted PTVs 
on the human transcriptome. Furthermore, this 
study indicates that nonsense-mediated decay 
has heterogeneous effects across tissues and 
also shows how to better detect splice-disrupting 
variants outside the “essential” sites at the splice 
junction. 

We find no evidence for widespread dosage 
compensation maintaining normal expression lev- 
els of genes affected by heterozygous PTVs. This, 
together with the fact that most human genes 
are haplosufficient (18), suggests that homeostatic 
mechanisms at the cellular level, possibly as pro- 
posed in the theory of dominance (27), maintain 
biological function in the face of heterozygous, 
or even homozygous (4), inactivation of hu- 
man genes. 

The resource made available with this study 
provides a starting point for cataloging var- 
iants affecting protein function, but larger data 
sets will be required to increase our power to 
predict molecular consequences of variants 
from sequence data alone. These results high- 
light the benefits of direct RNA sequencing of 
either patient tissue or genetically engineered 
cell lines for interpretation of genetic varia- 
tion and suggest that personal transcriptom- 
ics will become an important complement to 
genome analysis. 
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SUPERNOVAE 


“Ti gamma-ray emission lines 
from SN1987A reveal an 
asymmetric explosion 


S. E. Boggs,'* F. A. Harrison,” H. Miyasaka,” B. W. Grefenstette,” 

A. Zoglauer,' C. L. Fryer,” S. P. Reynolds,* D. M. Alexander,” H. An,° D. Barret,”® 
F. E. Christensen,? W. W. Craig,”’° K. Forster,” P. Giommi,” C. J. Hailey,” 

A. Hornstrup,” T. Kitaguchi,”* J. E. Koglin,’* K. K. Madsen,” P. H. Mao,” K. Mori,” 
M. Perri,’ M. J. Pivovaroff,'° S. Puccetti,” V. Rana,” D. Stern,’® 


N. J. Westergaard,? W. W. Zhang?” 


In core-collapse supernovae, titanium-44 (“Ti) is produced in the innermost ejecta, 
in the layer of material directly on top of the newly formed compact object. As such, 
it provides a direct probe of the supernova engine. Observations of supernova 1987A 
(SN1987A) have resolved the 67.87- and 78.32-kilo—electron volt emission lines from 
decay of “4Ti produced in the supernova explosion. These lines are narrow and 
redshifted with a Doppler velocity of ~700 kilometers per second, direct evidence 


of large-scale asymmetry in the explosion. 


upernova 1987A (SN1987A) in the Large 
Magellanic Cloud provides a unique op- 
portunity to study a nearby (50 kpc) core 
collapse supernova (CCSN) explosion and 
its subsequent evolution into a supernova 
remnant. SN1987A has validated some of our 
most basic assumptions about CCSN. The burst 
of neutrinos observed on Earth that preceded the 
visible emission confirmed that the overall ex- 
plosion is driven by the collapse of the central 
core to a neutron star (J, 2). The direct gamma- 
ray detections of 0.07 solar mass (Mg) of °°Co 
(3, 4 and 0.003 Mz of *"Co (5), and the cor- 
relation between the exponential decay of the 
optical light curve and lifetime of these isotopes 


1Space Sciences Laboratory, University of California, 
Berkeley, CA 94720, USA. “Cahill Center for Astronomy and 
Astrophysics, California Institute of Technology, Pasadena, 
CA 91125, USA. 3CCS-2, Los Alamos National Laboratory, Los 
Alamos, NM 87545, USA. “Physics Department, NC State 
University, Raleigh, NC 27695, USA. “Department of Physics, 
Durham University, Durham DH1 3LE, UK. °Department of 
Physics, McGill University, Rutherford Physics Building, 
Montreal, Quebec H3A 278, Canada. ‘Université de Toulouse, 
UPS-OMP, IRAP, Toulouse, France. ®°CNRS, Institut de 
Recherche en Astrophysique et Planétologie, 9 Av. colonel 
Roche, BP 44346, F-31028 Toulouse Cedex 4, France. °DTU 
Space, National Space Institute, Technical University of 
Denmark, Elektrovej 327, DK-2800 Lyngby, Denmark. 

101 awrence Livermore National Laboratory, Livermore, CA 
94550, USA. “Agenzia Spaziale Italiana (ASI) Science Data 
Center, Via del Politecnico snc |-00133, Roma, Italy. 
Columbia Astrophysics Laboratory, Columbia University, 
New York, NY 10027, USA. RIKEN Nishina Center, 2-1 
Hirosawa, Wako, Saitama, 351-0198, Japan. “Kavli Institute 
for Particle Astrophysics and Cosmology, SLAC National 
Accelerator Laboratory, Menlo Park, CA 94025, USA. 

ISINAF - Osservatorio Astronomico di Roma, via di Frascati 
33, |-00040 Monteporzio, Italy. “Jet Propulsion Laboratory, 
California Institute of Technology, Pasadena, CA 91109, USA. 
NASA Goddard Space Flight Center, Greenbelt, 

MD 20771, USA. 

*Corresponding author. E-mail: boggs@berkeley.edu 


670 8 MAY 2015 » VOL 348 ISSUE 6235 


(6, 7), confirmed that the light curve is driven by 
radioactive decay of these nuclei. 

SN1987A, however, held a few surprises. Models 
and light curves supported red supergiant pro- 
genitors for CCSN, but the SN1987A progenitor 
was blue (8). The °°Co gamma-ray lines emerged 
from the explosion several months before ex- 
pected, implying that some of the innermost 
heavy nuclear products had been mixed into 
the outer envelope (3). The °°Co gamma-ray 
line spectroscopy showed mixing to velocities of 
~3000 km s“|, several times higher than expected 
from spherically symmetric models, and also a 
net redshift of ~500 km s~’, which indicated 
large-scale asymmetry (9, 10). Although subsequent 
x-ray observations have revealed expanding, 
brightening ejecta, there has been no evidence 
of the compact object created in the explosion 
(11, 12). 

Here we present observations of “Ti emission 
from SN1987A with the Nuclear Spectroscopic 
Telescope Array (NuSTAR) focusing high-energy 
x-ray telescope (13). “Ti production occurs deep 
inside the supernova near the dividing line 
between ejecta and material that falls back on 
the compact object, and is sensitive to the CCSN 
explosion energy and asymmetries (/4, 15). The 
decay of “*Ti (85.0 years) results in the produc- 
tion of photons at 67.87 keV (93.0% of decays) 
and 78.32 keV (96.4% of decays) (16). NuSTAR 
observed SN1987A for multiple epochs between 
September 2012 and July 2014 with a total ex- 
posure of 2596 ks, at an average time of ~26.6 
years after explosion. A full list of the observa- 
tions is provided in table S1. Both the 67.87-keV 
line and the 78.32-keV lines of “*Ti are clearly 
seen in the NuSTAR spectrum (Fig. 1). The lines 
are well fit with a simple Gaussian line shape 
plus the underlying continuum, demonstrated 
through the chi-square test. All uncertainties 


are quoted here at the 90% confidence level 
unless otherwise stated. In the combined anal- 
ysis, we measure a 67.87-keV line flux of 3.5707 x 
10-*photons cm™ s~’, corresponding to a “Ti 
yield of 1.5*)? x 10-*M,. This derived yield 
assumes the decay rate measured in the lab- 
oratory and nearly neutral “*Ti. The NuSTAR 
optics response cuts off at 78.39 keV, poten- 
tially cutting off any blueshifted emission. How- 
ever, examination of the 67.87-keV line reveals 
a net redshift, allowing us to combine the lines 
for optimal analysis and use the individual lines 
to check for consistency (17). The signal-to-noise 
ratio for the detection of these combined lines 
is ~8.50. The line widths are consistent with 
the NuSTAR spectral resolution, and the corre- 
sponding upper limit on any Doppler broad- 
ening is 4100 km s” full width at half-maximum 
(FWHM). The 67.87-keV line centroid is red- 
shifted by 0.231909 keV. Corrections due to 
recession velocity and the look-back effect (17) 
are < 340 km s combined. Taking these cor- 
rections into account, the redshift velocity of 
the “Ti lines in the rest frame of SN1987A is 
700740 km s7). 

The NuSTAR yield can be compared to pre- 
vious measurements, indirect estimates, and theo- 
retical predictions. The International Gamma Ray 
Astrophysics Laboratory (INTEGRAL) observa- 
tory reported first detection of the “Ti 67.87-keV 
and 78.32-keV lines from SN1987A (18). INTEGRAL 
could not spectrally resolve these lines, but mea- 
sured a combined flux for both lines of (1.7 + 0.4) x 
10° photons cm~ s~}, corresponding to an initial 
mass of synthesized “477 of (3.1+ 0.8) x 10* Mp. 
The yield measured by NuSTAR is inconsistent 
with the high yield found by INTEGRAL. Chandra 
X-ray Observatory Advanced CCD Imaging Spec- 
trometer (ACIS) limits on Sc Ka emission have 
been used to set an upper limit on the “*Ti mass 
of <2 x 10 * Moz (19), consistent with the NuSTAR 
yield. Between 1994 and 2001, the ultraviolet, 
optical, and infrared (UVOIR) emission was dom- 
inated by the radioactive decay of “Ti, which led 
to a number of estimates of the “Ti yield based on 
detailed modeling of the UVOIR spectra (20-22) 
and bolometric luminosity (23, 24). These obser- 
vational studies do not all agree within their 
uncertainties, but generally fall in the range of (0.5 
to 2) x 10~* Mg of “Ti produced in the explosion 
(24). The “Ti yield measured by NuSTAR is in 
good agreement with most estimates based on the 
UVOIR bolometric light curves and spectroscopic 
modeling, providing support for the detailed 
models underlying these estimates; however, our 
measured redshift reveals a more complicated 
explosion structure than assumed in these models. 
Finally, theoretical predictions of the “Ti yield 
for SN1987A fall roughly within this same range 
of (0.5 to 2) x 10“ Mo, with lower yields gen- 
erally corresponding to spherically symmetric mod- 
els, and yields increasing with larger asymmetries. 
See reference (24) for a compilation of theory 
references. 

NuSTAR observations have set an upper limit 
on the Doppler broadening of <4100 km s7? 
FWHM, consistent with the widths of ~3000 km s* 
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Fig. 1. SN1987A 55- to 80-keV 
background-subtracted spectrum 
measured with NuSTAR. Data from 
both telescopes are combined (for 
presentation only) and shown with 

lo error bars. Both of the *4Ti lines are 
clearly measured. The vertical green 
lines are the rest-frame energies of the 
“477 lines (67.87 and 78.32 keV). The 
redshift is evident in both lines, indi- 
cating the asymmetry of the explosion. 
Also shown is the best-fit model, con- 
volved through the NuSTAR instrument 
response, for case (1), where the fitting 
parameters for the two lines are tied 
together (supplementary materials). 


Cts sec! keV"! 


originally measured for the *°Co lines (9, 10). 
Although the limit on broadening itself is not 
surprising, the measured redshift is both statis- 
tically significant and large compared to the up- 
per limit on Doppler broadening, indicative of 
an asymmetric ejection of “Ti in the initial ex- 
plosion. The *°Co gamma-ray lines also showed 
redshifts (~500 km s~), but the significance was 
marginal. The *°Co detection also stands con- 
trary to predictions of spherically symmetric 
explosion models that would produce blue- 
shifted gamma-ray lines due to increased absorp- 
tion of the receding redshifted emission. The 
redshifted °°Co lines suggest large-scale asym- 
metry in the explosion. 

There has been growing evidence for asym- 
metries in supernovae explosions over the past 
decades (25). In SN1987A itself, asymmetry was 
initially supported by extensive evidence for 
mixing and polarized optical emission as re- 
viewed in (26, 27), and later by spatially resolved 
images of the ejecta (27, 28). NuSTAR observa- 
tions of the spatial distribution of “Ti in the Cas 
A supernova remnant shows direct evidence of 
asymmetry (29). Our results here suggest an even 
higher level of asymmetry for SN1987A. For com- 
parison, NuSTAR measured a redshift for the in- 
tegrated Cas A spectrum of (2100 + 900) km s* 
and a line broadening corresponding to a fastest 
ejection velocity of ~5000 km s”. Given that ejec- 
tion velocity and the age of the remnant (340 years), 
the estimated “look-back” redshift velocity for 
Cas A is ~1400 km s™, consistent with the mea- 
sured redshift. From the spatially integrated “Ti 
spectrum alone, Cas A would not appear to have 
a Statistically significant asymmetry: The spatial 
brightness distribution in Cas A revealed the 
asymmetries. 

In the “‘Ti-powered phase, the dominant en- 
ergy input to the ejecta comes through the sub- 
sequent positron emission of “Sc, when most 
of the gamma rays escape the ejecta without 
interacting. These positrons are produced deep 
in the ejecta, and both simple estimates and de- 
tailed models suggest that they are locally ab- 
sorbed and instantaneously thermalized (20, 22). 
The implication is that the UVOIR emission of 
SN1987A during the “Ti-powered phase should 
be dominated by the ejecta spatially coinci- 
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dent with the “Ti ejecta. In principle, UVOIR 
spectral imaging in the “‘Ti-powered phase 
can yield direct evidence for asymmetries. Hub- 
ble Space Telescope (HST) obtained resolved 
spectral images of the SN1987A ejecta (28) 
from June 2000 (4857 days after explosion), near 
the end of the phase when the UVOIR emis- 
sion was truly dominated by “*Ti decay (20). 
They reveal a bipolar structure elongated along 
the north-south direction. There is a clear gra- 
dient in velocity across ejecta, with the north- 
ern component showing a redshift of about 
500 km s‘? in the [Ca II] 47300 emission line, 
whereas the southern component showed a 
larger redshift of about 1700 km s”'. The ejecta 
exhibit an overall redshift of ~1000 km s™‘. At 
the time, this asymmetry and overall redshift 
were noted but not emphasized, as they could 
be the result of blending of the [Ca IT] 47300 
line with a [O II] 47320 line. This shifted vel- 
ocity distribution is consistent with our mea- 
sured redshift of the “*Ti lines. On the basis of 
our “Ti line profile, we might naively imagine 
the picture of a bright, redshifted clump or jet 
of “Ti, with the UVOIR emission tracing the 
spatial and velocity distribution of this clump. 
However, the spatial distribution of the ejecta 
in this HST observation does not immediately 
reveal such a large spatial asymmetry. 

A single-lobe (i.e., very asymmetric) explosion 
model for SN1987A (30) could explain the ob- 
served evidence that °°Ni was mixed to speeds 
exceeding 3000 km s” and redshifted, as evi- 
denced by both the gamma-ray emission and the 
infrared forbidden line profiles of [Fe IT] (mainly 
produced through *°Ni decay) around 400 days 
after the explosion (37, 32). In this model, the 
single lobe is oriented at an angle pointing away 
from us, producing the redshifted lines (30). The 
NuSTAR observations appear consistent with these 
single-lobe models. One consequence of such a 
highly asymmetric explosion is that the compact 
object produced by SN1987A would, presumably, 
receive a kick velocity opposite the direction of 
the ejecta (33). 
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PHYSICS 


Creating and probing electron 
whispering-gallery modes 


in graphene 


Yue Zhao,”?* Jonathan Wyrick,’* Fabian D. Natterer,’* Joaquin F. Rodriguez-Nieva,** 
Cyprian Lewandowski,* Kenji Watanabe,” Takashi Taniguchi,” Leonid S. Levitov,” 


Nikolai B. Zhitenev,'+ Joseph A. Stroscio't+ 


The design of high-finesse resonant cavities for electronic waves faces challenges due to 
short electron coherence lengths in solids. Complementing previous approaches to confine 
electronic waves by carefully positioned adatoms at clean metallic surfaces, we 
demonstrate an approach inspired by the peculiar acoustic phenomena in whispering 
galleries. Taking advantage of graphene’s gate-tunable light-like carriers, we create 
whispering-gallery mode (WGM) resonators defined by circular pn junctions, induced by a 
scanning tunneling probe. We can tune the resonator size and the carrier concentration 
under the probe in a back-gated graphene device over a wide range. The WGM-type 
confinement and associated resonances are a new addition to the quantum electron-optics 
toolbox, paving the way to develop electronic lenses and resonators. 


harge carriers in graphene exhibit light- 
like dispersion resembling that of electro- 
magnetic waves. Similar to photons, electrons 
in graphene nanostructures propagate bal- 
listically over micrometer distances, with 
the ballistic regime persisting up to room tem- 
peratures (7). This makes graphene an appeal- 
ing platform for developing quantum electron 
optics, which aims at controlling electron waves in 
a fully coherent fashion. In particular, gate-tunable 
heterojunctions in graphene can be exploited to 


manipulate electron refraction and transmission 
in the same way that optical interfaces in mirrors 
and lenses are used to manipulate light (2). These 
properties have stimulated ideas in optics-inspired 
graphene electronics. First came Fabry-Pérot in- 
terferometers (3), which have been fabricated 
in planar npn heterostructures in single-layer 
graphene (4) and subsequently in bilayer (5) 
and trilayer graphene (6). The sharpness of the 
pn junctions achievable in graphene can enable 
precise focusing of electronic rays across the junc- 


tion, allowing for electronic lensing and hyper- 
lensing (7-9). 

We report on electron whispering-gallery mode 
(WGM) resonators, an addition to the electron- 
optics toolbox. The WGM resonances are familiar 
for classical wave fields confined in an enclosed 
geometry—as happens, famously, in the whisper- 
ing gallery of St. Paul’s Cathedral. The WGM res- 
onators for electromagnetic fields are widely used 
in a vast array of applications requiring high- 
finesse optical cavities (10-12). Optical WGM 
resonators do not depend on movable mirrors 
and thus lend themselves well to designs with 
a high quality factor. This can render the WGM 
design advantageous over the Fabry-Pérot design, 
despite challenges in achieving tunability due to 
their monolithic (single-piece) character [see (12) 
for a mechanically tunable optical WGM resona- 
tor]. Our system is free from these limitations, 
representing a fully tunable WGM resonator in 
which the cavity radius can be varied over a wide 
range by adjusting gate potentials. In contrast, the 
best electronic resonators known to date—the 
nanometer-sized quantum corrals designed by 
carefully positioning adatoms atop a clean me- 
tallic surface (13)—are not easily reconfigurable. 
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Fig. 1. Confined electronic states in microscopic electron cavities 
defined by pn junction rings in graphene. (A) The rings are induced by the 
STM tip voltage bias (V,) and back-gate voltage (V,), adjusted so as to reverse 
the carrier polarity beneath the tip relative to the ambient polarity. The pn 
junctions act as sharp boundaries giving rise to Klein scattering of electronic 
waves, producing mode confinement via the whispering-gallery mechanism. 
The cavity radius and the local carrier density are independently tunable by the 
voltages Vp and V,. Electron resonances are mapped out by the STM spec- 
troscopy measurements (see Fig. 2). Shown are the STM tip potential U(r) and 
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the quantities discussed in the text: the STM tip radius (R), its distance from 
graphene (d), and the local (9) and ambient (u...) Fermi levels with respect to 
the Dirac point. n and p label the electron and hole regions. (B) Spatial profile of 
WGM resonances. Confinement results from interference of the incident and 
reflected waves at the pn rings (dashed lines). The confinement is stronger for 
the larger angular momentum m values, corresponding to more oblique wave 
incidence angles. This is illustrated for m = 5 (weak confinement) and m = 13 
(strong confinement). Plotted is the quantity Re(wag), the real part of the 
second spinor component in Eq. 1. 
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Further, although WGM resonators are ubiq- 
uitous in optics and acoustics (10-12, 14), only a 
few realizations of such resonators were obtained 
in nonoptical and nonacoustic systems. These 
include WGM for neutrons (15), as well as for 
electrons in organic molecules (J6). In our measure- 
ments a circular electron cavity is created be- 
neath the tip of a scanning tunneling microscope 
(STM), and we directly observe the WGM-type 
confinement of electronic modes. The cavity is de- 
fined by a tip-induced circular pn junction ring, 
at which the reflection and refraction of electron 


waves are governed by Klein scattering (Fig. 1). 
Klein scattering originates from graphene’s lin- 
ear energy dispersion and opposite group veloc- 
ities for conduction and valence band carriers; 
Klein scattering at a pn junction features a strong 
angular dependence with a 100% probability for 
transmission at normal incidence, as well as fo- 
cusing properties resembling negative refractive 
index metamaterials (2, 7). Although Klein scattering 
is characterized by perfect transmission and no 
reflection for normal incidence, it gives rise to 
nearly perfect reflection for oblique incidence 


di/dV, (nS) se. 8 


di/dV,, (arb. units) 


Fig. 2. Confined electronic states probed by STM measurements. (A) Differential tunneling 
conductance (d//dV,) for a single-layer graphene device, as a function of sample bias (V,) and back- 
gate voltage (V,). The gate map was obtained after increasing the probe-tip work function by exposure to 
deuterium to shift the interference fringes vertically downward (fig. S5) (17). The two fans of interference 
features, marked WGM’ and WGM", originate from WGM resonances in the DOS (see text). (B) 
Interference features in dl/dV,p, calculated from the relativistic Dirac model. The features WGM' and 
WGM" in the (Vg,V_) map originate, respectively, from the conditions € = lo and € = Uo + eV (See text). The 
boundaries of the WGM' (and WGM") regions are marked by dashed (and dotted) white lines, respectively. 
arb. units, arbitrary units. (©) d//dV, spectra taken along the horizontal line in (A) at V, = 230 mV. (D) 
dl/dV,, spectra taken along the two vertical lines in the map in (A) at V, = 16 V (red line) and V, = —11 V (blue 
line, scaled x3 and offset for clarity) (see text for discussion). The four peaks at positive bias at Vz = 16 Vare 
fit to Gaussian functions, with the fits shown in the lower right of the figure. The peaks labeled 1”,2”,3”... 
correspond to WGM resonances probed at the energy € = Lio + eV, whereas the peaks labeled 1’,2’,3’..., are 
the same WGM resonances probed at the Fermi level € = po, giving rise to the WGM” and WGM' fringes in the 
gate maps, respectively. The resonance spacing of order 40 mV translates into a cavity radius of 50 nm, 
using the relation Ae = nive/r (see text). 
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occurring in the WGM regime (2). As illustrated 
in Fig. 1B, this yields excellent confinement and 
high-finesse WGM resonances for modes with 
high angular momentum m and a less perfect 
confinement for non-WGM modes with lower 
m values. 

Electron optical effects in graphene have so far 
been explored using transport techniques, which 
lack spatial and angular resolution that would 
be indispensable for studying confined electronic 
states and/or electron lensing. Our scanning probe 
technique allows us to achieve nanometer-scale 
spatial resolution. The STM probe has a dual 
purpose: (i) creating a local pn junction ring, 
which serves as a confining potential for electronic 
states, and (ii) probing by electron tunneling the 
resonance states localized in this potential. The 
planar back gate and the STM tip, acting as a 
circular micrometer-sized top gate, can change 
both the overall background carrier density and 
the local carrier density under the tip. As such, 
pn and np circular junctions centered under 
the probe tip (Fig. 1A) can be tuned by means of 
the tip-sample bias ;, and the back-gate voltage 
Vz [see fig. S4 (17)]. For the purpose of creating 
resonant electronic modes inside the junction, 
this configuration gives us in situ, independent 
control over the carrier concentration beneath 
the STM tip and the pn ring radius. The tunnel- 
ing spectral maps from such a device show a 
series of interference fringes as a function of 
the knobs (V;,,V,) (Fig. 2). These fringes origi- 
nate from resonant quasi-bound states inside 
the pn ring. 

The measured spacing between fringes (Ae) 
can be used to infer the cavity radius (7). Using 
the formula Ae = rhvy/r (fi, Planck’s constant h 
divided by 2x; vp =~ 10° m/s) and an estimate 
from Fig. 2A (Ae ~ 40 meV), we obtain 7 ~ 50 nm, 
a value considerably smaller than the STM tip 
radius (R = 1 um). This behavior can be under- 
stood from a simple electrostatic model of a charged 
sphere proximal to the ground plane. When the 
sphere-to-plane distance d is small compared with 
the sphere radius R, the induced image charge den- 
sity cloud p(r) behaves as p(r) <1/(d + r?/2R), 
predicting a length scale 2Rd « R. This crude 
estimate is upheld, within an order of magni- 
tude, by a more refined electrostatic modeling 
(17), which also gives a length scale much smaller 
than R. 

The experimental results were obtained on a 
device consisting of a graphene layer on top of 
hexagonal boron nitride, stacked on SiO, with a 
doped Si back gate [see supplementary materials 
for details (17)]. Figure 2A shows a tunneling 
conductance map as a function of back-gate volt- 
age (V,) on the horizontal axis and sample bias 
(V,) on the vertical axis. A series of interference- 
like fringes forming a curved fan (labeled WGM’) 
can be seen in the upper right of Fig. 2A. The 
center of the fan defines the charge neutrality 
point. This point can be off (0,0) in the (V,,V;,) 
plane due to impurity doping of graphene (shift 
along V,) and the contact potential difference 
between the probe tip and graphene (shift in V,). 
As illustrated in fig. S5 (17), we are able to shift 
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the center point of the fan to lower Vj, values by 
changing the tip work function, for example, 
with D, adsorption (78). Another interesting fea- 
ture in such conductance maps is a (some- 
what less visible) second fan of fringes (labeled 
WGM"), which is crossing the primary WGM’ 
fan. The fringes in the WGM” fan follow the 
typical graphene dispersion with respect to the 
Fermi energy, which varies with doping as & \/|Vz| 
from higher sample bias to lower as a function 
of V,. Examining the primary (WGM’) and sec- 
ondary (WGM") fringes more closely confirms 
that they originate from the same family of WGM 
resonances. 

Figure 2C shows nine oscillations in a line cut 
across the WGM' fan along the V, axis. To un- 
derstand the origin of these oscillations, we ex- 
amine the two spectral line cuts along the V, axis 
in Fig. 2D. The first spectrum in Fig. 2D at V, = 
-11 V (blue curve) contains a group of resonances 
(abeled 1” to 3”) near the Fermi level (V;, = 0) 
with a spacing of 37.6 + 1.2 mV (J9). In the map 
in Fig. 2A, these resonances can be seen to move 
to lower energies approximately following the 
typical Dirac point dispersion ~ \/|V,|. Taking 
a vertical cut at a higher back-gate voltage of V, = 
16 V (red curve) shows resonances 1” and 2” 
shifted down in energy in Fig. 2D. Focusing now 
at slightly higher energies, the WGM' resonances 
appear at positive energies in Fig. 2D and are 
labeled 1’ to 4’ for V, = 16 V. These four reso- 
nances are fit to Gaussian functions and shown 
deconvolved from the background conductance 
in the bottom right of the figure. The average 
spacing of these resonances is 116.9 + 7.5 mV (19). 
A close examination of Fig. 2A indicates the one- 
to-one correspondence between the WGM" reso- 
nances 1”, 2”,... and the WGM’ resonances 1’, 2’..., 
suggesting their common origin. We therefore 
conclude that the WGM” resonances corre- 
spond to tunneling into the pn junction modes 
at energy € = Ly + eVy [to, local Fermi level (see 
Fig. 1A)], whereas the WGM’ resonances reflect 
the action of the STM tip as a top gate, allowing 
tunneling into the same resonance mode at € = 
Ho [see fig. S3 (17)]. For example, resonance 1” 
seen at V, ~ -100 mV is now accessible at the 
Fermi level by the tip-graphene potential differ- 
ence, as shown in fig. S3D (17), when tunneling 
into the WGM' resonance 1’ at Vi, = 82 mV in 
Fig. 2A. 

To clarify the WGM character of these reso- 
nances, we analyze graphene’s Dirac carriers in 
the presence of a potential induced by the STM 
tip described by the Hamiltonian H = Hy + U(r), 
where Hp is the kinetic energy term and Ur) 
describes the STM tip potential seen by charge 
carriers. Because relevant length scales—the elec- 
tron’s Fermi wavelength and the pn ring radius—are 
much greater than the atomic spacing, we fo- 
cus on the low-energy states. We linearize the 
graphene electron spectrum near the K and K’ 
points, bringing Hp to the massless Dirac form: 
ew(r) = [ogo - p+ U(r)|(r), where p = -ihV, 
and o = (0x, oy) are pseudospin Pauli matrices. 
We take the tip potential to be radially sym- 
metric, reflecting the STM tip geometry. Fur- 
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thermore, the distance from the tip to graphene 
(d) is considerably smaller than the electron’s 
Fermi wavelength and the pn ring radius, both of 
which are smaller than the STM tip radius. We 
can therefore use a parabola to approximate the 
tip potential, U(r) = xr? (7, off-center displace- 
ment). The curvature «, which affects the energy 
spectrum of WGM resonances, can be tuned with 
the bias and gate potentials, as discussed in the 
supplementary materials (17). 

The WGM states with different angular mo- 
mentum can be described by the polar decom- 
position ansatz 


1 i(m-1)o 
W(t, >) = We ours ) (1) 


where ™ is an integer angular momentum quan- 
tum number, is the polar angle, and A, B label 
the two graphene sublattices. We nondimension- 
alize the Schrédinger equation using the charac- 
teristic length and energy scales (rx = Rd, 
ex = hog/ /Rd) to obtain the radial eigenvalue 
equation of the two-component spinor w(7) with 
components w,(7°) and wp(7) 


Me wr?) u(r) (2) 


eu(r) = (io. | 


Here r is in units of 7x, € is in units of ex, and 
« is in units of kx = ex/r%. This equation is 
solved using a finite difference method [see 
supplementary material (17)]. We can use this 
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Fig. 3. Contributions of the WGM resonances with different m to the DOS for the relativistic Dirac 
model. (A) Colored curves represent partial-m contributions from angular momentum values m = 1,2,3,4,5 
(see Eq. 3), evaluated for a confining potential U(r) = «r2 with curvature value x = x, = e,,/Rd. Different 
curves show the partial DOS contributions defined in Eq. 3, which are offset vertically for clarity. The inset 
shows the total DOS versus particle energy « and the curvature « (see text). The black curve shows the 
total DOS trace along the white line. (B) The Dirac wavefunction for different WGM states (see Eq. 1). 
Spatial structure is shown for several resonances in the partial DOS (black dashed circles mark the pn 
junction rings). The quantity plotted, Re(w,), is the same as in Fig. 1B. The length scale rx = VRd (the same 
in all panels) is marked. Note the confinement strength increasing with m. 
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microscopic framework to predict the mea- 
sured spectral features. The tunneling current, 
expressed through the local density of states 
(DOS), is modeled as 


Mo +eVp, 
r=). 'deT(e,Vs)EDnl(e), 
m 


Dm(e) =) e*"? |uy(r)|?5(e - ey) (3) 


which is valid for modest Vj, values (20). Here to 
is the Fermi energy under the tip, which in gen- 
eral is different from ambient Fermi energy bt. as 
a result of gating by the tip (see below). The 
transmission function 7T(e,V;,), which depends on 
the tip geometry, work function, and DOS, will 
be taken as energy-independent. The quantity 
D(€) = YmDm(e) represents the sum of partial-m 
contributions to the total DOS beneath the tip, 
with v labeling eigenstates of Eq. 2 with fixed m. 
The weighting factor e~*”’/? is introduced to ac- 
count for the finite size of the region where 
tunneling occurs, with the Gaussian halfwidth 
x-1/2 ~ r, [see supplementary materials (17)]. 

The WGM resonances for different partial-m 
contributions D,,,(€), which combine into the total 
DOS (Fig. 3), reveal that individual WGM states 
exhibit very different behavior depending on the 
m value [see Figs. 1B and 3B]. Klein scattering at 
the circular pn junction produces confinement cre- 
ating the WGMs, and the confinement is stronger 
for the large-m modes and weaker for small-m 
modes. The Klein reflection probability is strongly 
dependent on the angle of incidence 0 at the pn 
interface, growing as R(@)~1 — exp[-Esin?(6)], 
where & is a characteristic dimensionless param- 
eter (27). The value of 8 grows with m as tan 8° 7. 
As a result, larger values of ™ must translate into 
larger reflectivity and stronger confinement. This 
trend is clearly demonstrated in Figs. 1B and 3B. 
Also, as m increases, mode wavefunctions are 
being pushed away from the origin, becoming 
more localized near the pn ring, in full accord 
with the WGM physics. 

To understand how one family of WGM res- 
onances gives rise to two distinct fans of inter- 
ference features seen in the data, we must account 
carefully for the gating effect of the STM tip. We 
start with recalling that conventional STM spec- 
troscopy probes features at energies €) = Hp + eVp, 
where €,, are system energy levels. This corresponds 
to the family WGM” in our measurements. How- 
ever, as discussed above, the tip bias variation 
causes the Fermi level beneath the tip to move 
through system energy levels €,, producing an 
additional family of interference features (WGM") 
described by €n = Ug [see fig. S3 and accompany- 
ing discussion (17)]. To model this effect, we 
evaluate the differential conductance G = dI/dV, 
from Eq. 3, taking into account the dependence 
Uo versus Vi. This gives 


Go(1- )D(to + eVb) + ND(Uo) (4) 


with n = -Ot)/6(eV;,). The two contributions in 
Eq. 4 describe the WGM’ and WGM" families. 
We note that the second family originates from 
the small electron compressibility in graphene, 
resulting in a finite n and would not show up ina 
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system with a vanishingly small n (e.g., in a me- 
tal). We use Eq. 4 with a value n = 1/2 to generate 
Fig. 2B. In doing so, we use electrostatic mod- 
eling [described in (17)] to relate the parameters 
(e,«) in the Hamiltonian, Eq. 2, and the experi- 
mental knobs (V;, V,). This procedure yields a 
very good agreement with the measured d//dV;, 
(Fig. 2, A and B). 

In addition to explaining how the two sets of 
fringes, WGM’' and WGM", originate from the 
same family of WGM resonances, our model ac- 
counts for other key features in the data. In 
particular, it explains the large difference in the 
WGM' and WGM" periodicities noted above. It 
also correctly predicts the regions where fringes 
occur (Fig. 2B). The bipolar regime in which pn 
junction rings and resonances in the DOS occur 
(see fig. S6 and S7) takes place for the probed 
energies € of the same sign as the potential curva- 
ture. In the case of a parabola U(r) = xr’, this 
gives the condition ex > 0, corresponding to the 
upper-right and lower-left quadrants in Fig. 3A, 
inset. However, under experimental conditions, the 
potential is bounded by U(|r|—> ©) = tg — La (see 
Fig. 1A), which constraints the regions in which 
WGMs are observed (17). Accounting for the finite 
value U(|r|— ©) yields the condition |e|<|u9-p.|, 
with sgn(e) = sgn(x) = sgn(Uo - Hw). This gives 
the WGM’ and WGM" regions in Fig. 2B bounded 
by white dashed and white dotted lines, respec- 
tively, and matching accurately the WGM’ and 
WGM" location in our measurements. 

The range of m values that our measurement 
can probe depends on the specifics of the tun- 
neling region at the STM tip. We believe that a 
wide range of m values can be accessed; however, 
we are currently unable to distinguish different 
partial-m contributions, because the correspond- 
ing resonances are well aligned (Fig. 3). Different 
m states may contribute if the tunneling center is 
not the same as the geometric center of the tip, 
which is highly likely. As shown in (73), higher m 
states can be accessed by going off center by as 
little as 1 nm, which is likely in our real experi- 
ment due to a residual asymmetry of the STM tip 
[we model this effect by a Gaussian factor in Eq. 
3]. We note in this regard that different angular 
momentum 7m values translate into different or- 
bital magnetic moment values, opening an op- 
portunity to probe states with different m by 
applying a magnetic field. 

The explanation of the observed resonances in 
terms of the whispering-gallery effect in circular 
pn rings acting as tunable electronic WGM res- 
onators has other notable ramifications. First, it 
can shed light on puzzling observations of reso- 
nances in previous STM measurements (22-24), 
which hitherto remained unaddressed. Second, 
our highly tunable setup in which the electron 
wavelength and cavity radius are controlled in- 
dependently lends itself well to directly probing 
other fundamental electron-optical phenomena, 
such as negative refractive index for electron waves, 
Veselago lensing (7), and Klein tunneling (2). Fur- 
ther, we envision probing more exotic phenomena 
such as the development of caustics, where an inci- 
dent plane wave is focused at a cusp (25-27), and 


special bound states for integrable classes of dy- 
namics, where the electron path never approaches 
the confining boundary at perpendicular inci- 
dence (28). These advances will be enabled by 
the distinct characteristics of graphene that allow 
for electronic states to be manipulated at the 
microscale with unprecedented precision and 
tunability, thus opening a wide vista of graphene- 
based quantum electron-optics. 
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Migrating tremor off southern 
Kyushu as evidence for slow slip of a 
shallow subduction interface 


Y. Yamashita,'*+ H. Yakiwara,” Y. Asano,” H. Shimizu,’ K. Uchida,’ S. Hirano,” 
K. Umakoshi,* H. Miyamachi,” M. Nakamoto,’ M. Fukui,’ M. Kamizono,’ H. Kanehara,” 
T. Yamada,° M. Shinohara,® K. Obara® 


Detection of shallow slow earthquakes offers insight into the near-trench part of the 
subduction interface, an important region in the development of great earthquake ruptures 
and tsunami generation. Ocean-bottom monitoring of offshore seismicity off southern 
Kyushu, Japan, recorded a complete episode of low-frequency tremor, lasting for 1 month, 
that was associated with very-low-frequency earthquake (VLFE) activity in the shallow 
plate interface. The shallow tremor episode exhibited two migration modes reminiscent of 
deep tremor down-dip of the seismogenic zone in some other subduction zones: a large-scale 
slower propagation mode and a rapid reversal mode. These similarities in migration 
properties and the association with VLFEs strongly suggest that both the shallow and deep 
tremor and VLFE may be triggered by the migration of episodic slow slip events. 


stable sliding zone. Episodic tremor and slip (ETS) 
is the coupled phenomena of nonvolcanic deep 
low-frequency tremor (7) and slow slip events 


low earthquakes are a range of seismic phe- 
nomena with longer durations than ordi- 
nary earthquakes. They are considered key 


to understanding the generation of mega- 
thrust earthquakes because their activity 
affects stress accumulation on the locked zone. 
Slow earthquakes are well documented in the deep 
transition zone between the seismogenic zone and 


(SSEs) (2) with a specific recurrence interval. Ob- 
servations of ETS have been documented in 
Cascadia (3) and southwestern Japan (4). ETS 
events also coincide with deep very-low-frequency 
earthquake (VLFE) activity (5). These events share 
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Fig. 1. Shallow low-frequency tremor distribution and tec- 135°E 
tonic setting. Location map (inset) and bathymetry of the 
study area, showing spatiotemporal distribution of shallow trem- 
or (colored circles) [time zone is Japan standard time (JST), or 
universal time coordinated + 09:00], locations and numbers of 
OBS stations (yellow squares), epicenters of small repeating 
earthquakes (orange circles) (16), the coseismic slip area of M7- 
class interplate earthquakes (dark gray) (26, 27), and the outer 
edge of the subducted KPR (dark green bold) (24). The con- 
vergence direction of the Philippine Sea Plate (30) is denoted by 
the red arrow, the trench axis is indicated by the dashed blue line, 
and bathymetric contours are shown at 100-m intervals. KUSM 
is a land-based seismic station, which used in Fig. 2A. 
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acommon mechanism as shear slip on the plate 
interface (6). Because the spatiotemporal evolu- 
tion of deep tremor and VLFEs reflects the rup- 
ture process of SSEs (7), tremor activity is useful 
for assessing interplate aseismic slip in the deep 
transition zone. Many observations of deep trem- 
or suggest that SSEs are common on the deep 
side of the seismogenic zone. 

The shallow subducting plate interface near 
the trench was thought to be a conditionally sta- 
ble zone (8). Understanding the slip potential of 
this region is important for evaluating the devel- 
opment of great earthquake ruptures, such as the 
2011 Tohoku-Oki earthquake, and the associated 
potential for tsunami generation. Therefore, the 
nature of slow earthquakes (particularly SSEs, 
which account for the greatest moment release 
of slow earthquakes) has critical implications for 
the interplate slip and moment budget of the 
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shallow plate interface. Shallow VLFE activity has 
been observed in Japan (9-17) and Costa Rica (12), 
but low-frequency tremor observations near the 
trench are scarce (13). The previous studies have 
not indicated spatiotemporal variation of activities 
for shallow tremor and VLFEs. Although SSEs 
have recently been detected at both deep and 
shallow plate interface in southwest Japan by 
Global Navigation Satellite System (GNSS) anal- 
ysis (14), the spatiotemporal relationship with seis- 
mic shallow slow events is still unclear. 

The region east off the east coast of southern 
Kyushu (the western part of Nankai Trough) is 
one of the most active areas of ordinary earth- 
quake seismicity (fig. S1) and shallow VLFE seis- 
micity in southwestern Japan (9, 10). Estimation of 
interplate coupling from long-term ground defor- 
mation using GNSS data (15) and the existence of 
small repeating earthquakes (16) suggest that the 
subduction megathrust of this area is creeping 
(ow interplate coupling) while producing small 
and moderate-sized interplate earthquakes. The 
locked Nankai subduction zone to the northeast 


A Ocean bottom seismogram (H1+H2: 2-8Hz RMS envelope) 


provides a sharp contrast to this region. We ex- 
pected a phenomenon like ETS to occur in the 
shallow plate interface because shallow VLFE seis- 
micity is seen repeatedly (fig. S2), although no 
tremor has been previously reported in the area. 
We deployed 12 short-period ocean bottom seis- 
mometers (OBSs) for ~80 days to investigate shal- 
low tremor activity of this area (Fig. 1) (17). 
Fortuitously, the first major episode of shallow 
VLFE activity in 3 years started just after we de- 
ployed the OBSs. This VLFE episode occurred in 
the same area as previous activity, according to 
array analysis of data from land-based broadband 
seismic stations (10) (fig. S2). The episode lasted 
for 1 month and included three active stages (Fig. 
2C). During the same time period, our OBS network 
recorded a complete, month-long episode of shallow 
tremor activity. The dominant frequency of the 
tremor waveforms, ranging from 0.5 to 8 Hz, was 
lower than that of ordinary earthquakes (Fig. 3) 
(17). Each peak of tremor in the OBS observations 
almost corresponded to each wave train recorded 
by land-based broadband seismographs, after al- 
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Fig. 2. Comparison with shallow tremor and VLFE activity. (A) Example of shallow tremor in 1-hour 
continuous root mean square (RMS) envelopes from the OBS network and land-based station KUSM (locations in 
Fig. 1). Envelope traces were processed with a 2- to 8-Hz band-pass filter and smoothed with a 5-s moving window. 
Note that the gain of the trace from KUSM is 20 times as high as that of the OBS stations. The event at 08:46 is an 
ordinary small earthquake near OBS station 11. (B) Example of shallow VLFE in 1-hour continuous waveforms from 
F-net land-based broadband stations operated by the National Research Institute for Earth Science and Disaster 
Prevention. Waveform traces were processed with a 10- to 50-s band-pass filter. UD, up-down (vertical) com- 
ponent. (C) Comparison of spatial (left) and temporal (right) distribution of shallow tremor (red) and shallow 
VLFEs (gray). Note that the shallow tremor and VLFEs were detected every 1 min and 15 s, respectively. 
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lowing for the difference of ~1 min in travel time 
between the two sets of stations (Fig. 2, A and B). 

The shallow tremor episode started on 28 May 
2013, had major activity on 10 to 14 June and 16 
to 21 June, and was essentially finished on 27 June 
2013 (Fig. 2C). The time sequence of the tremor 
episode is similar to that of VLFEs. The source 
locations of the shallow tremor were estimated 
using the envelope correlation method (7), assum- 
ing an S-wave velocity of 3.5 km/s (17). The en- 
velope functions are from combining the horizontal 
channels, and the tremor locations come from 
2-min windows. Horizontal locations were con- 
strained to less than +5 km within the OBS net- 
work (fig. $5), and they roughly overlapped the 
VLFE focal area (Fig. 2) (17). Although this meth- 
od has poor depth resolution, well-constrained 
shallow tremors were concentrated at an estimated 
depth of ~15 km (fig. S6), consistent with the depth 
of the plate interface in this region. The shallow 
tremor was distributed in a narrow belt parallel 
to the trench (Fig. 1), matching the up-dip limit 
of interplate small repeating earthquakes in this 
region (16), which reflects the up-dip limit of the 
zone where the ordinary interplate earthquakes 
occur. Background seismicity is also nonactive in 
the focal region of shallow tremor (fig. $1). Thus, 
the shallow tremor and ordinary interplate earth- 
quakes appear to be spatially distinct. 

We detected two distinct sequences of migrating 
tremor events, separated by an aseismic period 
lasting a few days. In the first sequence, events began 
off the east coast of Tanegashima Island in early 
June 2013. They then migrated generally north- 
ward with a slight change to the north-northwest 
around OBS station 8, reaching the area of OBS sta- 
tions 6 and 7 on 12 to 14 June (fig. $7). In the second 
sequence, events resumed south of OBS station 8 
on 16 June. They migrated north-northwestward 
to the vicinity of OBS station 7, then turned sharp- 
ly east to the area around OBS station 9 (fig. S8). 
Compared with the location of subducted Kyushu- 
Palau Ridge (KPR), the first migration stopped 
within the ridge but the second migration over- 
rode and reached the outside of the ridge’s edge. 

In the first and second sequences, tremor mi- 
grated over a length of 90 km in 3 days and 1.5 days, 
respectively (Fig. 4). The tremor generally propa- 
gated along strike (before turning up-dip on June 
18) at speeds of approximately 30 to 60 km/day, 
similar to or somewhat higher than the deep 
tremor at Nankai and Cascadia (18, 19). The mi- 
gration front of first sequence typically shows a 
piecewise linear pattern, as in the deep ETS event 
(20). In addition, the very short duration in which 
tremor migrated in the reverse direction at speeds 
of 100 to 200 km/day occurred during the later 
part and after the propagation of first migration 
sequence (Fig. 4). This fast and backward migration 
is, to some extent, similar to the rapid tremor re- 
versal (RTR) observed in deep ETS activity (8, 19). 

The migration property of shallow tremor and 
its activity associated with VLFEs basically resembles 
the pattern of deep tremor during ETS events. 
Some theoretical studies have explained this 
tremor migration as the successive failure of small 
fault patches within the brittle-ductile transition 
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zone, as SSEs propagating on the plate interface 
increase the stress around zones of high slip rate 
(21, 22). Therefore, the migration of shallow tre- 
mor may typify episodic SSEs in the near-trench 
shallow plate interface, analogous to the coupled 
phenomena of deep tremor, VLFEs, and episodic 
SSEs at the down-dip side of the locked zone (7). 
Although this study did not detect shallow SSEs 
associated with tremor and/or VLFE activity through 
geodetic evidence, the migration of shallow tremor 
is consistent with the migrating rupture front of 
SSE. These findings suggest that the both deeper 
and shallower sides of a subduction plate inter- 
face have similar frictional properties. The cou- 
pled phenomena of tremor, VLFEs, and SSEs 
probably occur in both plate interfaces. 

The shallow tremor that we documented has 
some features in common with deep tremors, 
which suggests that there is a common genera- 
tion process between shallow and deep tremors, 
as expected from processes like dehydration (J). 
The undulating surface structure of the KPR may 
play an important role in effective transportation 
of the clay minerals and fluid at depth and may 
thus be the key to the generation of shallow trem- 
or in this area. On the other hand, there are some 
differences between both tremors in the details. 
The migration fronts of the two sequences of 
shallow tremor moved slightly faster than deep 
tremor sequences, which migrate at speeds rang- 
ing from a few to a few tens of kilometers per day 
(18, 19). The shallow tremor migrated over al- 
most the same area after a very short interval 
(2 days), and the second migration was faster 
than the first migration (fig. S9). We lack reports 
of large-scale and along-strike multiple migrations 
during a deep tremor episode, although RTR and 
very fast tremor streaks that migrate along the dip 
direction at 30 to 200 km/hour (23) are known to 
occur repeatedly, even within an episode. There- 
fore, the properties of large-scale and along-strike 
migration of shallow tremor are similar but not 
identical to those of deep tremor. The rapid reverse 
propagation is also not the same as the deep RTR 
reported for the Cascadia deep tremor that oc- 
curs during the linear propagation of the overall 
tremor front (19). Rheological differences between 
shallow and deep tremor zones may explain the 
difference in the behavior between both tremors. 

The subducted KPR is thought to be a seg- 
ment boundary of coseismic slip (24). Meanwhile, 
the migration episodes of tremor propagated ob- 
liquely across the KPR, which suggests that this 
ridge did not work as a strong segment bound- 
ary for the slow rupture. This behavior is consist- 
ent with the argument that subducted seamounts 
provide favorable conditions for aseismic creep and 
small earthquakes but unfavorable conditions for 
generation and propagation of large ruptures (25). 

Shallow tremor stopped migrating further in 
the northward direction and turned sharply east- 
ward around OBS station 7. This area is located in 
the up-dip side where large [moment magnitude 
(M,,) 6.5 to 7.5] megathrust events occur (26, 27). 
Therefore, the locking or creeping state of deeper 
megathrust changed along-strike, and the migra- 
tion to the northward direction was blocked by 
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the locked zone. In addition, the migration did 
not finally reach to the up-dip side of the rupture 
area of the M,, 7.5 megathrust event. These ob- 
servations suggest that the migration of shallow 
tremor occurred in a plate interface, which is the 
shallower extension of the creeping megathrust. 
In other words, the shallow tremor migration 
may be a response to the creeping of the deeper 
megathrust. Whether it can also occur up-dip 
of a locked megathrust such as at Nankai and 
Cascadia awaits future observations. 


As the occurrence of the megathrust event 
approaches, the state of interplate coupling will 
become progressively weakened from the surround- 
ing area, even despite the locked megathrust. As 
a result, a change of spatiotemporal activity of 
shallow slow earthquakes is expected in the up- 
dip side of the locked zone, similar to the case for 
deep slow earthquakes, as predicted by some 
computer simulation studies (28). 

The worst-case scenario is for shallow slow 
earthquakes to precede a large seismic rupture, 
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Fig. 3. Characteristics of shallow tremor in time and frequency domain. (A) Example waveforms of 
shallow tremor recorded at OBS stations. A vertical component waveform (red) is superimposed on a 
horizontal component (black). Each trace is normalized by the maximum amplitude (Amp.) of a horizontal 
component. More example waveforms are shown in fig. S3. (B) Example of power spectra of shallow tremor, 
ordinary earthquake, and background noise observed at OBS station 8. The ordinary earthquake spectrum is 
calculated from the S-wave record and its coda using a horizontal channel. The selected event occurred within 
several kilometers of the tremor and had approximately the same amplitude. The time-window length is 10.24 s, 
starting 2 s before the S-wave arrival. The tremor spectrum and background noise are stacked and from the 
same horizontal channel, using a 10.24-s window over a 102.4-s time period. The instrument response was 
corrected. An example of deep tremor and background noise spectra from west Shikoku, Japan, is shown 
recorded at the land-based seismic station also shown in fig. S4. PSD, power spectral density. 
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Fig. 4. Spatiotemporal change of shallow tremor activity. (Top) Hourly event counts (N, number of 
events) and (bottom) space-time plot of shallow tremor along line N-S in Fig. 1. Yellow polygons show the 
location and number of OBS stations. The gray shaded area indicates lower precision for the area outside 
the OBS network. The first and second migrations propagated north-northwestward at 30 to 60 km/day, 
and three RTR events (red shading) propagated in the reverse direction at 100 to 200 km/day. 
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such as the slow slip observed just before the 
Tohoku-Oki earthquake (29). Therefore, monitor- 
ing the spatiotemporal changes of shallow slow 
earthquakes is important for evaluating the slip 
of the shallow plate interface offshore. This aids 
assessment of the potential hazard of tsunamigenic 
earthquakes. Long-term ocean-bottom observations 
of many subduction zones and geodetic observa- 
tions to confirm the suspected shallow SSEs for 
this region will further clarify the relationship be- 
tween shallow slow earthquakes and the frictional 
behavior of megathrusts. 
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Adhesion and friction in mesoscopic 


graphite contacts 


Elad Koren, Emanuel Lortscher, Colin Rawlings, Armin W. Knoll, Urs Duerig* 


The weak interlayer binding in two-dimensional layered materials such as graphite gives 
rise to poorly understood low-friction characteristics. Accurate measurements of the 
adhesion forces governing the overall mechanical stability have also remained elusive. 
We report on the direct mechanical measurement of line tension and friction forces acting 
in sheared mesoscale graphite structures. We show that the friction is fundamentally 
stochastic in nature and is attributable to the interaction between the incommensurate 
interface lattices. We also measured an adhesion energy of 0.227 + 0.005 joules per 
square meter, in excellent agreement with theoretical models. In addition, bistable 
all-mechanical memory cell structures and rotational bearings have been realized by 
exploiting position locking, which is provided solely by the adhesion energy. 


dhesion and friction play a central role in 

mechanical systems because they are direct- 

ly responsible for the energetics, dissipa- 

tion, and wear (J). In particular, nanoscale 

systems are exceptionally susceptible to ad- 
hesion and friction as a consequence of a very 
large surface-to-volume ratio (2). Motivated by 
these issues, research has strongly focused on 
layered materials exhibiting weak layer-to-layer 
coupling, henceforth referred to as interlayer bind- 
ing. Carbon-based materials are a particularly prom- 
ising material class in this respect. A general 
phenomenon associated with layered materials 
is the strong suppression of sliding friction and 
stiction forces, termed superlubricity (3-5). Super- 
lubricity arises from a structural mismatch be- 
tween rotated atomically flat surfaces. As a result, 
shear displacements in mesoscale structures that 
are cut along a glide plane spontaneously revert 
to the initial equilibrium position, solely driven 
by interfacial adhesion forces, as soon as the ex- 
ternally applied displacement force is released 
(6, 7). 

In view of the high prospective impact of gra- 
phitic materials, many research groups have studied 
the interlayer binding energy theoretically. How- 
ever, such calculations are challenging because 
of the substantial contribution from van der Waals 
interactions and the localized nature of the 1-orbital 
bonding across the graphite planes (8-12). Cur- 
rent estimates for the interlayer binding ener- 
gy range from 0.14 J m™ (24 meV per atom) to 
0.34 J m™ (56 meV per atom). Experimental data 
for the adhesion energy of graphite are scarce, and 
measurements rely on indirect methods building 
on the exfoliation of multiwalled carbon nano- 
tubes, o = 0.21 + 0.06 J m” (13); the flexure me- 
chanics of thin graphite sheets, o = 0.19 + 0.01J m7 
(14); or measurements of the desorption energy 
of aromatic molecules, o = 0.32 + 0.03 J m”? (15). 
We distinguish between the theoretically calcu- 
lated binding energy (pertaining to the separation 
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of two surfaces that initially are in perfect atomic 
registry) and the experimental adhesion energy 
(relating to the separation of two surfaces from an 
equilibrium contact position for which the lat- 
tices do not necessarily match). 

Adhesion and friction in layered materials in 
general, and in graphite specifically, are not well 
understood. Here, we report on an experimental 
study of the interface adhesion and friction in 
mesoscopic structures made of highly oriented 
pyrolytic graphite (HOPG). We exploit the unique 
nanopositioning accuracy and force sensitivity of 
scanning probe microscopy to repeatedly shear 
individual nano-sized graphitic mesa structures 
and to measure the applied shear forces during 
their mechanical exfoliation along a single basal 
glide plane. The shear force is composed of a re- 
versible displacement force and a smaller irrevers- 
ible friction force characterized by a force hysteresis. 
This strategy allows us to investigate the funda- 
mental mechanisms leading to friction and to relate 
the reversible displacement force to the adhesion 
interaction between the sliding surfaces. In addi- 
tion, the displacement force characteristics can 
be engineered by the mesa design outline to yield 
multiple local energy minima at which the sheared 
structures can be arrested. 

The studies were performed on an atomic force 
microscopy (AFM) setup under ambient condi- 
tions. Samples featuring cylindrical and more com- 
plex shaped mesa structures with a typical height 
of 50 nm were fabricated from high-quality HOPG 
substrates by means of reactive ion etching, using 
structured Pd-Au metal layers as self-aligned shad- 
ow masks (Fig. 1A) (7, 14). For mechanical shear- 
ing, a Pt/Ir metal-coated AFM tip was brought into 
contact with the metal mask on top of the mesa 
(Fig. 1C). The tip apex was cold-welded to the 
metal by applying a force and electrical current 
pulse of 50 nN and 1 mA, respectively, for 1s. Asa 
result, a strong mechanical contact between the 
tip and the metal mask formed, which allowed us 
to apply lateral shear forces of up to 200 nN and 
to induce a shear glide along a single basal plane 
in the HOPG structure. After sliding, the exposed 
basal plane surface was free of observable defects, 
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as confirmed by AFM inspection of sheared 
mesas (Fig. 1E). 

We measured the lateral shear force required 
to move the tip parallel to the mesa plane at zero 
applied normal force (Fig. 2). The observed force 
signal has several distinct features: (i) The lateral 
force changes sign when the tip passes from the 
left position through the center and then to the 
right position, reaching the same average plateau 
value of Fp = 45 nN. (ii) In the plateau regions, 
the lateral force is essentially independent of the 
scan direction and position. (iii) The measured 
force signal is highly reproducible throughout 
repeated lateral scans. From (i) and (ii), we con- 
clude that the lateral shear force is dominantly 
conservative in nature; from (iii), we conclude that 
the sheared interface geometry is stable through- 
out the experiment and the manipulations are 
nondestructive. 

Starting from the center position toward the 
right or left side, the lateral force builds up at a 
rate of ~10 nN nm” until it saturates at the plateau 
value Fp. In this initial phase, the upper section 
of the mesa, termed the mobile section, remains 
fixed until sufficient shear stress is built up in the 
mesa to trigger yielding at a glide plane. Sliding 
of the mobile section sets in at a tip displacement 
of ~5 nm, after which the tip displacement rep- 
resents the actual sliding motion of the mobile 
section (Fig. 2B). The force trace and retrace 
signals exhibit substantial fluctuations and hys- 
teresis (Fig. 2B, inset), similar to those observed 
in the sliding friction of carbon nanotubes (16) 
and in superlubricity sliding of nanoscale flakes 
(4) for a rotational mismatch of greater than 5° 
between the sliding surfaces. 

Conservative displacement forces are strictly a 
function of the displacement 2, whereas dissipa- 
tive friction forces depend on the direction of mo- 
tion. The overall energy dissipation for a closed 
path of length ¢ is given by 


a 
Eas = |)Fude (1) 


where the friction force is given by the difference 
of the measured shear force at a position x be- 
tween trace and retrace sliding, F,,(”) = Fs trace(@) - 
Fyetrace(@). An apparent correlation appears as 
high or low friction force values persist over 
sliding distances of 10 to 50 nm (Fig. 3A). We 
also observed unusual behavior, such as short- 
lived amplitude spikes (up to 10 nN) and neg- 
ative friction force over some sliding paths. The 
latter feature indicates energy recovery during 
such phases. 

The values of the friction force are Gaussian- 
distributed, with a mean value of 1.6 nN and a var- 
jance of og = 2.45 nN (Fig. 3C). From the central 
limit theorem, we conclude that the friction force 
arises from statistically independent events with 
finite correlation lengths. Therefore, we expect 
that the variance of the distribution of the friction 
force averaged over a sliding interval Av should 
converge to a zero value, provided that enough 
independent events are probed in the sliding in- 
terval. On the other hand, correlation manifests 
itself as a persistence of the variance if the averag- 
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ing interval Av is smaller than the correlation 
length. As anticipated from the intuitive inter- 
pretation of the friction force data (Fig. 3A), the 
variance decays as expected for statistically in- 
dependent events if the averaging interval is 
larger than ~20 nm (Fig. 3D). By averaging over 
a sliding distance of 90 nm, we obtain a robust 
number for the mean friction force of 1.6 nN 
(lo uncertainty of 0.3 nN). 

From the measured interface energy o ~ 
0.227 J m™ (see below), we estimate the load 
force to be on the order of P = (6/c)(nr”) = 24 uN 
for a cylindrical mesa with a radius of r = 100 nm 
and assuming an interaction length c ~ 0.3 nm, 
on the order of the c-axis lattice spacing. From 
this estimated load, we obtain an effective fric- 
tion coefficient of p ~ 7 x 10°, which is more than 
three orders of magnitude lower than the typical 
friction coefficient u ~ 0.1 to 0.3 observed in dry 
sliding contacts. 


A 


Metal lift-off etch-mask 


Slide Rotation 


Theoretical arguments and simulations pre- 
dict that the friction force does not scale with the 
contact area.A (17, 18). The effect originates from 
a cancellation of forces acting on the interface 
atoms except at the periphery. The exponent y in 
the scaling equation Fy, <A’ can assume values 
in the range from 0 to 0.5, depending on the type 
of lattice mismatch and the shape of the sliding 
object (18, 19). Recently, fractional scaling has been 
confirmed in an experiment that measured the 
sliding friction forces of amorphous antimony 
and single-crystal gold islands on a graphite sur- 
face (20). For the amorphous islands, an expo- 
nent of y = 0.5 was measured, as predicted by 
theory for a random interface. For the single- 
crystal Au islands, an exponent of y = 0.33 + 0.15 
was obtained. We find y = 0.35 + 0.05, in agree- 
ment with this latter value (Fig. 3B). 

The friction force can also be written as Fy, = 
FyN’, where Fo is the mean force acting on an 


Laser 
Photodiode 


Pd/Au 


Bi-stable 


geometry 


FL =o dA/dx 


Mesa section 
fixed to substrate 


Fig. 1. Experimental procedures. (A) Metal masks consisting of Pd (10 nm) and Au (15 nm) are 
fabricated on the freshly cleaved surface of a HOPG sample by electron beam lithography and liftoff 
techniques. The mesa structures emerge during a dry oxygen plasma etch, which selectively thins down 
only the unprotected HOPG surface by 50 nm. The mesa structures are sheared along a basal glide plane 
by applying a corresponding force to the top metal surface. (B) The adhesion energy is determined by 
measuring the line tension force F, acting on sheared cylindrical mesas. Stabilization of a rotation axis is 
possible, allowing the rotation of the mesa around the cylinder axis, while a dumbbell structure provides 
multiple stable equilibria whenever cylindrical sections overlap. (©) Schematic of the AFM experiment. A 
Pt/Ir tip is cold-welded to the metal mask on top of the mesas. Force is applied by a shear motion, and the 
shear force is measured via the induced cantilever torsion. (D) Scanning electron microscopy image of 
cylindrical mesa structures with a radius of 200 nm and etch depth of 50 nm. (E) AFM image of a 
completely sheared cylindrical mesa (100 nm height range mapped to nonlinear color scale). The mesa 
was sheared at a basal plane 10 nm above the substrate surface, and the top section was put down on the 
substrate surface to the right of the original mesa. The tip contact point close to the center is visible as a 
small hillock on the Au top surface. 
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individual atom and Nis the number of interface 
atoms. We find Fy = 1473°pN (22) (Fig. 3B), in 
good agreement with the value of Fy ~ o/(cn) = 
20 pN estimated from the measured adhesion 
energy with an atomic density at the interface (7) 
of 38.2 nm~. The results suggest that the friction 


force originates from a genuine interaction be- 
tween the rotationally misaligned graphite lat- 
tices at the sliding interface. This is remarkable 
because it is a well-known empirical fact (20), also 
predicted theoretically (17, 22), that fractional 
scaling is an extremely fragile interface property 
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Fig. 2. Shear force measurements. (A) Time traces of the tip displacement x with respect to the center 
position (C) and the measured shear force Fs acting on the top surface of a cylindrical mesa with a radius 
of 100 nm. The shear force changes sign in accordance with a displacement to the right (R) or to the left (L) 
with respect to the center position, but not in accordance with the scan direction. (B) Measured shear 
force versus tip displacement. Black and blue curves denote the scan direction from left to right and from 
right to left, respectively, as indicated by the arrows. Inset: Enlarged view of the shear force for a trace and 
retrace scan showing the nonreversible friction contribution. 
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Fig. 3. Dissipative friction forces. (A) The friction force is defined as the difference of the shear force at 
a position x between trace and retrace sliding. The data represent a concatenation of five different friction 
force curves recorded over a sliding distance of 90 nm for mesa structures with a radius of 100 nm. 
(B) Mean value of the friction force versus cross-sectional area mr or number of interface atoms N = 
38.2 nm? x nr* determined from 90-nm trace-retrace curves of circular mesa structures with a radius r 
from 100 to 250 nm. Dashed line is a least-squares fit to the data with (F;,) = FoN”, with Fo = 14 pN and y = 
0.35. Error bar indicates the variance of, = 0.3 nN of the data determined from (D) for Ax = 90 nm. (C) 
Histogram of the distribution of the friction force recorded at a sampling interval of Ax = 0.05 nm and a 
Gaussian fit (dashed line) with a mean value of (F‘-) = 1.6 nN and variance of of = 2.45 nN. (D) Variance of; of 
the friction force distribution as a function of the averaging interval Ax. (E) Power spectral density (PSD) 
determined from the experimental friction force data shown (A). The data reveal a f?° scaling (dashed 
line). The inset shows the PSD as a function of period in the range from 0.1 to 0.5 nm. 
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that can only occur if the lattice interaction is not 
perturbed by defects or contaminations. 

Numerical simulations of adhesion interaction 
between two circular graphene sheets with radii 
of 4 to 15 nm show that a moiré superstructure, 
attributable to the lattice misorientation in the 
rotated sheets, is the root cause for the fluctuating 
shear forces (27). The superstructure is isomorphic 
to the graphene lattice and consists of domains 
of approximate AB and AA stacking correspond- 
ing to atomic and hollow sites in the graphene 
lattice, respectively. The lattice period of the su- 
perstructure is given by 


a 
Le (2) 
V2-2cos ® 
where a = 0.142 nm denotes the in-plane lattice 
constant of graphite and © is the rotation angle. 
The moiré pattern is shifted roughly perpendic- 
ular to the slide direction by one lattice constant 
L for a lateral displacement of the graphene sheets 
by one lattice unit a. As a result, the shear force 
exhibits a multitude of periodic maxima and min- 
ima, with periods on the order of 0.2 to 0.4 nm, 
whenever stacking domains enter or leave the 
overlap area at the boundary (fig. S3A). According 
to the Tomlinson mechanism, such short-period 
force fluctuations are responsible for energy dis- 
sipation and friction in the system. The ampli- 
tude of the force fluctuations scales with the overlap 
area approximately to the power of 0.35, similar 
to the friction force scaling observed in the ex- 
periment, and scales with the rotation angle to 
the power of -1.5 (fig. S4). From the simulations 
we conclude that the mean rotational misalign- 
ment between the sliding surfaces in the experi- 
ment must be between 5° and 10° which gives 
the correct order of magnitude of fluctuations of 
the shear force in relation to the experimental 
friction force values (fig. S4B). 

In a real experiment, sliding is performed 
using a compliant actuator, and hence neither 
the rotation angle ® nor the off-axis y position 
are strictly constant, which profoundly affects 
the sliding friction (23, 24). Allowing the top slid- 
ing surface to follow a path in the x-y-® space 
that locally minimizes the interface energy leads 
to a hidden pseudo-random walk trajectory in 
the y-® space and corresponding correlations. In 
the model simulation, we also included thermal 
activation by allowing the system to choose among 
all path energy options, with probabilities propor- 
tional to the corresponding Boltzmann factors. 
The surprising result of the simulation is that the 
character of the force fluctuations changes from a 
periodic pattern (fig. S3A) to a highly random spiky 
pattern (fig. S6A). This pattern is strikingly simi- 
lar to the measured friction force (Fig. 3A). Rota- 
tional degrees of freedom have been proposed as a 
mechanism leading to irregular sharp peaks in 
the friction signal due to the restoration of a high- 
friction state when the sliding surfaces reorient 
into a commensurate state (23). In our case, howev- 
er, the force spikes are not signaling a commen- 
surate state between the sliding lattices, but rather 
a randomly occurring high energy barrier in the 
potential landscape of rotated sliding interface. 
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The correlations arising from the pseudo-random 
walk in the y-® space are reflected in a non-integer 
power law scaling with an exponent of -1.5 of the 
power spectral density determined from the ex- 
perimental (Fig. 3E) and simulated data (fig. S6). 
Interestingly, the lattice structure still shows up 
in the power spectrum at spatial periods in the 
range from 0.2 to 0.5 nm with a prominent peak 
at 0.2 nm, corresponding to the periods obtained 
for a rigid slide—that is, Ay = 0 and A® = 0. The 
fact that we experimentally observe spectral fea- 
tures that can be traced back to the lattice in- 
teraction provides additional evidence that the 
measured friction force is genuinely due to a moiré 
interaction between rotated surfaces at the inter- 
face. It is also clear from the pseudo-random walk 
mechanism that forward and backward slides will 
never follow exactly the same path; this explains 
the huge spread of the observed local friction 
force, which can even assume negative values over 
some slide intervals. 

The dominant conservative part of the lateral 
force is due to the interface energy at the glide 
planes, which gives rise to a line tension force 
opposing the creation of new surface area. The 
component of the line tension force in the slide 
direction is 


F; a 3 
L=29 de (3) 
where the interface tension o denotes the mean 
adhesion energy per unit area, A is the overlap 
area of the mesa sections, and x is the displace- 
ment of the mobile section with respect to the 
initial position. For a mesa with a circular cross 
section of radius 7, we find 


A(x) = 2r? | cos (-E 2) - 


AO 


(4) 


and 


The maximum force Fs9 = 2or that has to be ap- 
plied to initiate sliding must be equal to the line 
tension force at zero displacement x. Therefore, 
Fg scales linearly with 7, as is confirmed by the 
experimental data (Fig. 4A). We obtain an inter- 
face tension of o = 0.227 + 0.005 J m™ (21), which 
is ~15% larger than previously reported exper- 
imental values measured for the interaction 
between graphite sheets in contact (13, 14). The 
value is in excellent agreement with the mean 
interface energy calculated from the model of 
Kolmogorov et al. (9, 21). 

To validate the interface energy picture, we 
measured the force characteristics for a complete 
slide from perfect overlap at 2 = O nm to zero 
overlap at v = 27 (Fig. 4B). The measured force 
curves for the cylindrical mesas are perfectly pre- 
dicted by Eq. 5 using our experimental value of o. 
Also shown is the force curve measured for a 
compound mesa structure consisting of a cylin- 
drical section with a radius of 200 nm connected 
to arectangular beam section pointing along the 
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slide direction (Fig. 4B). The first part of the force 
curve is identical to that of a circular structure, 
and for x > 400 nm the force is constant because 
of the linear decrease of the overlap area on the 
rectangular beam section. The slight deviation 
from the predicted characteristics at the cross- 
over is due to the tapered profile in the actual 
device. 

We observed a gradual asymptotic approach 
to the line tension force curves characterizing 
mesa sliding. This shows that the process leading 
to superlubricity is a continuous one, which in 
some cases can extend over a distance of up to 
50 nm, as seen in the force trace of the large 
compound mesa structure (Fig. 4B). On the other 
hand, assuming that the graphite stack is in per- 
fect lattice registry, we would expect that a huge 
breakoff force is required in order to overcome 
the lattice barrier. The binding energies for reg- 
ular AB stacking are on the order of E,p = 0.14 to 
0.34 J m~, yielding a breakoff force on the order 
of A x (Eap/c) A x (0.5 to 1.1 nN nm”). For our 
mesa structures, this is in the range from 1 to 
100 uN, which we never observed. The details 
of the breakoff mechanism are not clear. Initia- 
tion of the glide plane could, for example, be tied 
to stacking fault planes (25) or could involve the 
propagation of local stacking boundary defects 
(26). Because we see no breakoff force, we con- 
clude that the shear strength Y, for basal plane 
sliding in graphite mesostructures is solely gov- 
erned by line tension forces, Ys = F,,/A = o(d log 
A/dzx), and therefore it depends on the structure’s 
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geometry rather than being a generic material 
constant. 

To assure the feasibility of surface force-driven 
actuation, we must require that the friction forces 
are negligible with respect to the line tension 
forces; that is, max Fy, < Fy. The friction forces 
scale as the radius 7 of the structure to the power 
of 2y. A safe bound for the maximum of the fric- 
tion force in accounting for the substantial fluc- 
tuations is provided by a 6o,, criterion, yielding 
an estimated maximum friction force of max Fy, ~ 
605,(To)[(7/ro)""1, where 79 denotes the radius of a 
characterized reference structure. With F;, from 
Eq. 5, we obtain a lower cutoff radius of 


1 
s eo i) 1 -2y (6) 
ae fo) 
Substituting the measured values from the ex- 
periment, 7p = 100 nm, o,,(79) = 2.45 nN, y = 0.35, 
and o = 0.227 J m™, we conclude that line ten- 
sion forces dominate friction forces down to struc- 
ture sizes with a radius of ~2 nm. This result is 
encouraging in view of the technical feasibility of 
graphene-based nanomechanical devices. How- 
ever, structures with dimensions on the order of 
tens of nanometers would still be required in 
order to guarantee low-energy dissipation actu- 
ation in line with the low value of the mean fric- 
tion force. 
Determining the underlying theory to calcu- 
late intrinsic adhesion energy and forces allows 
us to engineer the structure geometry with respect 
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Fig. 4. Conservative forces due to adhesion energy. (A) Shear force at onset of mesa sliding Fs as a 
function of the radius of cylindrical mesas. Standard deviation indicated by error crosses (21). Three 
different cantilevers (tips A, B, and C) were used. Dashed line corresponds to a weighted least-squares fit 
to the data, yielding o = 0.227 J m~ with a variance of 0.005 J m™. (B) Measured shear force Fs as a 
function of the lateral displacement x of the mobile top mesa section for different mesa structures and 
different cantilevers. Solid lines show the calculated shear force according to Eq. 5. The dotted line in the 
graph of the compound structure shows the extrapolation of the shear force for the cylindrical part only. 
(C) Dumbbell mesa structure with a height of 80 nm before and after sliding to a stable equilibrium 
position where the circular sections overlap. The glide plane is at a height of 70 nm above the substrate 
surface. (D) Calculated adhesion energy for the dumbbell mesa structure. 
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to a targeted energy and corresponding displace- 
ment force profile. For example, structures fea- 
turing multiple local energy minima displacement 
profiles act as metastable geometric traps. Switch- 
ing between the trap states involves only minimal 
frictional resistance and virtually zero wear. The 
mechanical transition between local minima can 
be realized by different means such as mechan- 
ical, electrostatic, and magnetic actuation or by a 
thermal gradient mechanism (27). Multiple ener- 
gy minima positions are obtained, for example, 
in a dumbbell structure (Fig. 4, C and D). The 
AFM tip is used to first induce a glide plane de- 
fect and then direct the mobile upper part from 
the initial energy minimum position (Fig. 4C, left 
inset) to the other local energy minimum posi- 
tion where the cylindrical sections overlap (Fig. 
4C, right inset). After actuation, the tip is released 
from the metal surface by applying a negative load 
of >200 nN, and the actuated mesa is imaged in 
the tapping mode. Note that the glide plane sur- 
face of the fixed bottom mesa is clearly visible in 
the AFM image. Adhesion guiding combined with 
ultralow-friction properties can also be exploited 
for stabilizing rotational structures (fig. S1) and 
thus provides a versatile engineering platform for 
nanoscale devices in general. 
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SOLAR CELLS 


Impact of microstructure on local 
carrier lifetime in perovskite 


solar cells 


Dane W. deQuilettes,’ Sarah M. Vorpahl,' Samuel D. Stranks,”* Hirokazu Nagaoka,* 
Giles E. Eperon,” Mark E. Ziffer,’ Henry J. Snaith,” David S. Ginger't+ 


The remarkable performance of hybrid perovskite photovoltaics is attributed to their long 
carrier lifetimes and high photoluminescence (PL) efficiencies. High-quality films are 
associated with slower PL decays, and it has been claimed that grain boundaries have a 
negligible impact on performance. We used confocal fluorescence microscopy correlated 
with scanning electron microscopy to spatially resolve the PL decay dynamics from films of 
nonstoichiometric organic-inorganic perovskites, CH3NH3Pbl3(Cl). The PL intensities and 
lifetimes varied between different grains in the same film, even for films that exhibited long 
bulk lifetimes. The grain boundaries were dimmer and exhibited faster nonradiative decay. 
Energy-dispersive x-ray spectroscopy showed a positive correlation between chlorine 
concentration and regions of brighter PL, whereas PL imaging revealed that chemical 
treatment with pyridine could activate previously dark grains. 


s active layers in solar cells, organic-inorganic 

perovskites (J, 2) combine the promise of 

solution processing (3, 4) with the ability 

to tailor the band gap through chemical 

substitution (5-7), yielding solar cell power 
conversion efficiencies as high as 20.1% (8). Con- 
comitant with their photovoltaic performance, 
perovskites also exhibit high fractions of radia- 
tive recombination, with apparent carrier lifetimes 
of 250 ns or longer (9, 10), and are challenging 
the dogma that solution-processed semiconductors 
inevitably possess high densities of performance- 
limiting defects. Ensuring that all recombination 
is radiative is critical for approaching the ther- 
modynamic efficiency limits for solar cells and 
other optoelectronic devices (11). 

Carrier recombination lifetimes measured by 
photoluminescence (PL) are commonly taken as 
a hallmark of perovskite film quality, with longer 
decay lifetimes used as indicators of better- 
performing materials (9, 10, 12-14). Carrier re- 
combination kinetics have been described as a 
combination of trap-assisted, monomolecular (first- 
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order), and bimolecular (second-order) recom- 
bination (15). Although most studies agree that 
radiative bimolecular recombination dominates 
at high initial carrier densities (n> > 10" cm~*) 
(15-18), reports of kinetics at lower excitation 
densities (and relevant to solar cell operation) 
(19) range from single-exponential (72, 20), to 
biexponential (73, 14), to stretched-exponential 
(6, 9) functions with varying levels of fidelity. 
These distributions have in turn been explained 
in terms of unintentional doping (27) or charge 
trapping (22). The perovskite growth conditions 
(3, 4, 10) and post-deposition treatments (72, 23) 
can greatly alter film morphology, carrier lifetime, 
and device performance, yet the underlying rela- 
tions between these parameters are important 
open questions. For instance, perovskite films 
grown from nonstoichiometric mixed halide 
(Cl/D precursor solutions have exhibited lifetimes 
of hundreds of nanoseconds, but PL lifetimes in 
films grown from chloride-free precursors are 
generally much shorter (9, 20). 

Correlated confocal PL and scanning electron 
microscopy (SEM) have been a powerful tool to 
reveal structure/function relationships in biol- 
ogy (24). We applied similar techniques to study 
structure/function relationships in perovskite films. 
We found substantial local PL heterogeneity even 
for CH;NH3PbI,(Cl) films with average lifetimes 
of ~1 us (comparable to the longest lifetimes re- 
ported) (9, 10), which suggests that considerable 
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scope remains for reducing nonradiative recom- 
bination in these films. In addition to observing 
entire grains that appear dark, we also observed 
that grain boundaries are associated with PL 
quenching, indicating that they are not as benign 
as has been suggested previously (25, 26). We 
further used PL microscopy to show that post- 
deposition chemical treatments can activate pre- 
viously dark regions in the film, and we correlated 
local energy-dispersive x-ray spectroscopy (EDS) 
with confocal fluorescence maps, finding that 
brighter grains with longer lifetimes were asso- 
ciated with local spikes in Cl concentration. 

We studied CH;NH3PbI,(Cl) films prepared on 
glass slides by spin-coating a nonstoichiometric 
mixed halide precursor solution composed of 
CH;NHzgI and PbCl, (3:1) in N,N-dimethylformamide 
(19, 27). Films prepared under identical condi- 
tions and incorporated into standard solar cell 
device architectures (fig. S1) (19) exhibit power 
conversion efficiencies (n) up to 14.5% (Fig. 1A), 
which is comparable to efficiencies in other re- 
ports using this architecture (2, 28). Figure 1B 
shows that our PL lifetimes are as long as those 
reported for films used in the best devices to date 
(0). These films exhibited average carrier life- 
times >1000 ns when excited at low intensity 
(30 nJ/em?, np ~ 10" cm). At short times (Fig. 
1B, inset), the PL decay could appear nearly single- 
exponential, but at longer times, the decay de- 
viated from a single-exponential decay (6, 9, 15). 
We fit the decay in Fig. 1B with a stretched- 
exponential function of characteristic lifetime t, = 
431 ns and distribution parameter B = 0.57, which 
we interpret as arising from a superposition of 
exponential relaxation functions (see below) with 
an average lifetime of <r> = 1005 ns (19, 29). 

Green and co-workers recently examined mi- 
croscopic PL quenching of discontinuous perov- 
skite islands with n- and p-type capping layers 
(30). Here, we used fluorescence microscopy to 
probe the inherent decay properties of neat semi- 
conducting films. Figure 1 shows a correlated 
SEM micrograph (Fig. 1C), confocal PL image (Fig. 
1D), and an overlaid SEM/PL microscopy image 
(Fig. 1E) of a high-performing perovskite film on a 
glass substrate. Although this film appears con- 
tiguous (Fig. 1C) and exhibits <z> = 1005 ns, Fig. 
1D shows a large distribution in local PL intensity 
across the film. We observed these large distribu- 
tions in films prepared in different research labs 
(fig. S2D) (19), and we exclude variations in film 
thickness (fig. S2) (9) and photodegradation 
during imaging (fig. S3) (19) as primary causes. 
The PL intensity not only varied from grain to 
grain, with roughly 30% of grains imaged in Fig. 
1C consisting of dark grains (79), but we also ob- 
served ~65% lower PL intensity at grain bounda- 
ries (fig. S4, A to C) (19), after deconvolution of 
the microscope point-spread function (fig. $5) (29). 
These results are surprising because, through con- 
siderations of detailed balance (JI, 31), one ex- 
pects high-performance films to have minimal 
nonradiative decay. 

Instead, the spatial variations in PL intensity 
in the polycrystalline perovskite films are sug- 
gestive of variations in local nonradiative decay 
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Fig. 1. Solar cell device measurement, bulk PL lifetime measurement, and correlated images from 
(SEM) and fluorescence microscopy experiments. (A) Light current-voltage (J-V) characteristics of a 
high-performing mixed halide perovskite solar cell. (B) Bulk time-resolved PL decay trace of CH3NH3Pbl3(Cl) 
perovskite film on glass after excitation at 470 nm, 125 kHz, 30 nJ/cm? (no~10"°cm°) and fitted to a stretched- 
exponential function with <t> = 1005 ns, (1. = 431 ns, 8 = 0.57), with nearly single-exponential dynamics at short 
times (inset). (C) Correlated SEM micrograph, (D) fluorescence image, and (E) composite image showing 
significant variations in PL intensity across different grains and grain boundaries. 
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Fig. 2. Fluorescence microscopy of CH3;NH3PbI3(Cl) film and local PL measurements. (A) A 3 um—by—-3 um 
fluorescence image of the perovskite film with bulk lifetime <t> = 1010 ns (t_ =433 ns, B = 0.57). (B) Relative steady 
state PL spectra of bright (red square) and dark (blue circle) regions. (C) Time-resolved PL decay curves of bright 
(red square) and dark (blue circle) regions after excitation at 470 nm, 125 kHz, @ = 1uJ/em? (no ~ 5 x 10 cm), (D) 
= 21 pJ/em? (ng ~ 1 x 10%), and (E) bright region measured at ~ = 2.1 1J/cm? versus dark region measured 
at p = 3.4 pJ/cm? (no ~ 1.6 x 10cm), showing that dark regions require higher initial carrier densities to 
exhibit kinetics dominated by bimolecular recombination. Black traces are simulations to the data (19). 
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rates. By taking local steady-state and time-resolved 
PL data, we confirmed that darker regions have 
greater nonradiative loss. Figure 2 shows a con- 
focal PL image (Fig. 2A) along with local PL 
spectra (Fig. 2B) and lifetime data (Fig. 2, C to E) 
from a film with a long average bulk lifetime 
(<tr> = 1010 ns). Figure 2B shows the steady- 
state spectra of a bright (red square) and dark 
(blue circle) region. The PL spectrum collected 
at the dark region is both red-shifted (~2 nm) 
and slightly broader than the bright region (fig. 
S6) (19). These trends suggest a less sharp band 
edge (32), probably caused by the presence of 
defect states or shallow trapping levels in the 
darker regions. In Fig. 2, C to E, we show local 
PL decays of the indicated dark and bright re- 
gions at low (1 wJ/cm?”), medium (2.1 pJ/cm?), and 
high (3.4 uJ/cm?) excitation fluences. Several 
studies have reported a transition from trap- 
assisted monomolecular recombination to free- 
carrier bimolecular recombination over this fluence 
range (15, 18). Consistent with the picture that 
bright regions have fewer nonradiative pathways, 
bright regions show a slower decay, a transition 
to bimolecular recombination-dominated kinetics 
at a lower excitation fluence, and more efficient 
PL quenching when contacted by fullerene (fig. 
S7) (19) in comparison to dim regions. We mod- 
eled the PL dynamics (black lines in Fig. 2, C to E) 
as a combination of trapping, monomolecular, 
and bimolecular recombination (19). We report a 
higher deep trap-state density in the dark region 
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(4 x 10'° cm™) as compared to the bright region 
(<1 x 10° cm”). In addition, we extracted the 
trapping, monomolecular, and bimolecular de- 
cay rates in both regions to be 1 x 107° cm? s“¢, 
1x 10° s"}, and 2.3 x 10°" em? s/ to 7.8 x 10°" 
cm? s"1, respectively. We also report consistent 
ratios of PL intensity measured across the steady 
state and integrated time-resolved PL measure- 
ments (fig. S8) for bright and dark regions (19). 

The local PL lifetimes are also shorter at grain 
boundaries (fig. S4E) (79). Grain boundaries fre- 
quently serve as nonradiative recombination cen- 
ters in polycrystalline semiconductor films (33). 
Studies have suggested that grain boundaries in 
perovskites are less detrimental than in other 
semiconductors (25, 26), or even beneficial (34). 
Other results suggest that single-crystal perovskites 
exhibit even higher performance (35-37), and some 
describe improvements in carrier lifetime and de- 
vice performance from post-growth treatments, 
such as exposure of the film to pyridine (C;H;N), 
in the context of surface passivation (12, 23). 

In this context, we next show that pyridine va- 
por exposure can brighten dark domains. In Fig. 3, 
A and B, we show the PL from a CH3NH3PbI,(Cl) 
film before and after exposure to pyridine vapor. 
The entire film was both brighter (~8x integrated 
over the entire image) and more uniform after 
pyridine exposure (no enhancement without pyr- 
idine, fig. S9) (19). For instance, the relative in- 
crease of a dark domain (squares in Fig. 3, A and 
B), was 180% larger than the relative increase of 
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Fig. 3. Fluorescence microscopy of CH3NH3PbI3(Cl) film with pyridine vapor treatment. (A) Flu- 
orescence image before and (B) after treatment showing activation of the CH3NH3Pbl3(Cl) film. (€) Bulk 
steady-state PL spectra showing the relative PL intensities before (blue circle) and after (red square) 
treatment (inset) and normalized spectra showing a slight blue shift and narrowing of full width at half 
maximum after treatment. (D) Grain boundary PL line scan before [blue line in (A)] and after [red line in (B)] 
treatment, showing slight relative reduction in PL quenching across the grain boundary after treatment. 
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a bright domain (circles). The PL emission also 
blue-shifted (by ~3 nm) and narrowed slightly 
(Fig. 3C) after pyridine exposure, which could be 
caused by a reduction in shallow trap density. 
Finally, Fig. 3D shows that the grain boundary 
brightness, relative to the surrounding grains, 
increased by 11% and the width decreased by 25% 
after pyridine exposure (see fig. S10 for other 
examples) (19). Both trends are consistent with 
passivation, albeit incomplete, of defects at grain 
boundaries. Although pyridine treatment can also 
result in some restructuring of film morphology 
(fig. S10) (79), these data suggest that pyridine 
was indeed remediating nonradiative defects in 
the perovskite film. 

Finally, we explored the role of Cl in perovskite 
films by comparing SEM/EDS elemental compo- 
sition traces with local PL intensity traces. Figure 4A 
shows an overlaid SEM/PL microscopy image of 
a CH3NH3PbI,(Cl) film on glass. We tracked the 
changes in PL intensity with Cl content (Fig. 4B) 
and showed that bright regions correlate with 
areas of higher relative Cl content (C1/Cl+D (trace 
Cl in CH,NH3PbI; control films, fig. $11) (79). Al- 
though films prepared from Cl/I mixed halide 
precursors stoichiometrically resemble triiodides 
(38), there is evidence for residual Cl at levels of 
~2 weight % or less (13, 39). The lifetime enhance- 
ment in the presence of Cl is consistent with 
recent findings that Cl-rich nucleation sites lead 
to better crystal coalescence (40, 41) and helps 
explain why films grown in the presence of Cl have 
slower recombination rates (9, 17). We hypothe- 
size that Cl could be present at the surface or 
within the crystals, interstitially or substitution- 
ally, or simply at the substrate surface as a resid- 
ual but unincorporated component left over from 
the seeding of low-defect crystallites. We performed 
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Fig. 4. Correlated images and line scans of 
CH3NH3PbI,(Cl) film using fluorescence micros- 
copy, SEM, and EDS. (A) SEM micrograph over- 
laid on fluorescence image and (B) EDS line scan 
showing that the local elemental weight ratio of 
CI/(Cl+l) tracks areas of higher integrated PL inten- 
sity, indicating that Cl is associated with better- 
performing grains. 
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time-of-flight secondary ion mass spectrometry 
(ToF-SIMS) (fig. S11F) (29) and found higher 
Cl content in CH;NH3PbI;(Cl) films than in 
CH3NH3PbI, films without Cl. This technique 
probes the top 2 nm of the film. 

Although perovskite solar cells have better ra- 
diative efficiencies than many other types, such 
as dye-sensitized, organic, or even cadmium tel- 
luride solar cells, they still suffer from greater non- 
radiative losses than inorganic materials such as 
gallium arsenide and are only at present approach- 
ing the radiative efficiencies of copper indium 
gallium selenide (CIGS) (37). Our results identify 
a subpopulation of dark grains and grain bound- 
aries as specific targets for perovskite growth and 
passivation studies, and show that local fluores- 
cence lifetime imaging provides a route by which 
changes in film processing can be evaluated to 
assess their influence on carrier recombination 
in films. By removing these nonradiative path- 
ways to obtain uniform brightness with high 
emissivity across all grains, it is likely that we will 
see the performance of perovskite devices approach 
the thermodynamic limits for solar cells and 
other light-emitting devices. 
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CATALYSIS 


Identification of molybdenum oxide 
nanostructures on zeolites for natural 


gas conversion 


Jie Gao,’ Yiteng Zheng,’ Jih-Mirn Jehng,”? Yadan Tang,” 


Israel E. Wachs,”* Simon G. Podkolzin™* 


Direct methane conversion into aromatic hydrocarbons over catalysts with molybdenum 
(Mo) nanostructures supported on shape-selective zeolites is a promising technology for 
natural gas liquefaction. We determined the identity and anchoring sites of the initial Mo 
structures in such catalysts as isolated oxide species with a single Mo atom on aluminum 
sites in the zeolite framework and on silicon sites on the zeolite external surface. During 
the reaction, the initial isolated Mo oxide species agglomerate and convert into carbided 
Mo nanoparticles. This process is reversible, and the initial isolated Mo oxide species can 
be restored by a treatment with gas-phase oxygen. Furthermore, the distribution of the Mo 
nanostructures can be controlled and catalytic performance can be fully restored, even 


enhanced, by adjusting the oxygen treatment. 


ethane (CH,), the main component of 
natural gas, has the highest H-to-C ratio 
of all hydrocarbons; therefore, it is more 
environmentally friendly in terms of CO. 
emissions than oil or coal-derived fuels. 
However, 30 to 60% of natural gas reserves are 
classified as “stranded” because shipping gas is 
not economical, and the costs of liquefaction or 
building a pipeline are usually prohibitively high 
(1-5). The problem of natural gas utilization is ex- 
acerbated by burning and venting of the associated 
gas produced in the course of crude oil production 
at remote locations. Conversion of methane into 
shippable liquids can solve these problems but 
remains scientifically challenging (7-3, 6-8). 
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One of the technologies under development is 
direct methane conversion into liquid aromatic 
hydrocarbons in a single step (dehydroaromati- 
zation with the main reaction 6CH, — CgHg + 
9H.) using catalysts with Mo nanostructures 
supported on shape-selective zeolites (2, 8-16). 
This technology offers two advantages over other 
methane activation chemistries: Complete oxida- 
tion, as well as explosive combustion, is not possi- 
ble because of the absence of O, or other oxidizing 
reagents, and processing can be performed at 
remote locations because no reagents are needed. 
The biggest issues in commercialization are rapid 
catalyst deactivation and comparatively low single- 
pass conversion levels of ~10% (2, 8, 13-16). De- 
velopment of improved catalysts has been hindered 
by a lack of molecular-level understanding of the 
identity of the zeolite-supported Mo nanostruc- 
tures and their structural transformations. 

We studied Mo nanostructures supported on 
ZSM-5 zeolites by combining quantum chemical 
calculations using density functional theory (DFT) 
with multiple spectroscopic techniques, includ- 
ing in situ ultraviolet-visible diffuse reflectance 
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spectroscopy (UV-vis DRS), in situ infrared (IR) 
spectroscopy, and operando Raman spectroscopy 
at elevated reaction temperatures with simulta- 
neous online mass spectrometry of reaction pro- 
ducts. We determined the identity and anchoring 
sites of the initial Mo oxide nanostructures and 
established structure-activity relationships. The 
catalytic activity can be fully restored by regener- 
ating initial Mo oxide nanostructures with a gas- 
phase O», treatment. Furthermore, the activity 
can even be enhanced by controlling the distri- 
bution of Mo oxide nanostructures by adjusting 
conditions of such an O, regeneration treatment. 

Molybdenum nanostructures supported on ze- 
olites were initially present in an oxide form after 
Mo deposition and an oxygen treatment at ele- 
vated temperatures (our samples were calcined 
at 773 K) (17). The number of MoO, units in an 
average individual nanostructure was evaluated 
using the edge energy (E,) of the in situ UV-vis 
DRS spectra. The £, values for the following well- 
defined Mo oxide reference compounds are pre- 
sented in Fig. 1A: (i) MoO,-coordinated Mo,-Mo, 
clusters, (ii) linear chains of alternating MoO, and 
MoOg units, (iii) infinite layered sheets of MoO; 
units, (iv) Mo,0, dimer as MoO3-O-MoOs, (v) iso- 
lated MoO, and MoO, monomers, and (vi) aque- 
ous molybdate anions as a function of the solution 
pH (8). The E, values in Fig. 1A exhibit a linear 
correlation with the number of bridging Mo-O-Mo 
covalent bonds around the central Mo cation and, 
correspondingly, with the number of MoO, units 


Fig. 1. Spectroscopic measurements. 
(A) Electronic edge values based on in 
situ UV-vis spectra of reference Mo oxide 
compounds exhibit a linear correlation 
with the number of bridging Mo-O-Mo 
covalent bonds around the central Mo 
cation. The value of 4.8 eV for 2 wt % 
Mo/ZSM-5 (Si/AI = 15) corresponds to 
Mo oxide species with a single Mo atom. 
(B and C) In situ Raman spectra of 
Mo/ZSM-5 catalysts under oxygen flow 
at 773 K as a function of (B) Mo loading for 
constant Si/Al = 15 and (C) Si/AI ratio for 
constant 1.3 wt % Mo loading with band 
assignments to Mo oxide species based on 
DFT calculations. a. u., arbitrary units. 
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in a nanostructure. The £, value for a representa- 
tive catalyst sample with 2 weight percent (wt %) 
Mo supported on a ZSM-5 (Si/Al = 15) zeolite, 
which is the most common zeolite evaluated for 
methane dehydroaromatization, was 4.8 eV, which 
falls in the range of isolated MoO, nanostructures 
with a single Mo atom. 

The nature of the Mo oxide nanostructures was 
further examined with in situ Raman spectros- 
copy by varying the concentration of Mo from 0.7 
to 3.3 wt % on a ZSM-5 zeolite support with a 
constant Si/Al ratio of 15 (Fig. 1B) and by varying 
the Si/Al ratio from 15 to 140 at a constant Mo 
concentration of 1.3 wt % (Fig. 1C). The spectrum 
for 13 wt % Mo on ZSM-5 with Si/Al = 15 is 
shown in both sets in Fig. 1, B and C, and a similar 
spectrum is shown in operando Raman measure- 
ments with methane flow in fig. S1 (17). The ab- 
sence of sharp Raman bands from crystalline MoO; 
nanoparticles (NPs) at 996, 815, and 666 cm" (19) 
or crystalline Al,(MoO,)3 at ~1004 and 1045 cm™ 
(18, 20) indicates, in agreement with the UV-vis 
results in Fig. 1A, that Mo oxide was completely 
dispersed; any amorphous Mo oxide species would 
crystallize at the elevated pretreatment temper- 
ature of 773 K. Some spectra exhibited weak 
shoulder features at 950 cm™ from Mo oxide 
species in zeolite framework vacancy defects and 
at 1026 cm™’ from Mo oxide species on extra- 
framework alumina NPs (17). 

For the ZSM-5 (Si/Al = 15) zeolite in Fig. 1B, a 
single Raman band at 993 cm“ was observed in 
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the Mo-O stretching region for all Mo concen- 
trations. However, at higher Si/Al ratios in Fig. 1C, 
a new band at 975 cm‘ was observed, and an ad- 
ditional band appeared at 984 cm“ at the highest 
Si/Al = 140 (Fig. 1C). These three bands cannot be 
attributed to a single Mo oxide nanostructure 
because their relative intensities change with the 
Si/Al ratio. To determine the identity and anchor- 
ing sites of these Mo oxide structures in the ZSM-5 
zeolite framework, various monomeric Mo oxide 
species were evaluated with DFT calculations, 
and the calculated normal vibrational modes 
were compared with the experimental Raman 
spectra. 

After calcination at 773 K, Mo was present in 
its highest oxidation state of +6, as evidenced by 
the absence of d-d transitions for reduced Mo in 
the in situ UV-vis spectra. Our DFT calculations 
show that neutral MoO; species on framework Si 
sites are unstable and that framework Al sites are 
required for anchoring (17). This result is in agree- 
ment with changes in the in situ IR spectra for 
surface OH groups as a function of the Mo load- 
ing in fig. $2 (17) that showed preferential elim- 
ination of Bronsted acid sites (H* on [AlO4]) 
after Mo deposition. On a site with two adjacent 
framework Al atoms, the stoichiometry of the Mo 
oxide species should be Mo(=O),”* as dioxo species 
to counterbalance the 2- charge of 2[AlO,] and 
maintain Mo in the +6 oxidation state. The size of 
isolated Mo dioxo species serves as a geometric 
restriction, which determines the acceptable range 
of separation distances between the two anchor- 
ing framework Al-atom sites. Because ZSM-5 is a 
Si-rich zeolite, Lowenstein’s rule prohibits one Al 
atom to be the first neighbor of another Al atom 
in the framework as Al-O-Al. An arrangement of 
Al-O-Si-O-Al with two Al atoms as second neigh- 
bors was not found experimentally, based on 7”Al 
nuclear magnetic resonance (NMR) and addi- 
tional characterization for ZSM-5 samples with 
Si/Al > 8 (27, 22). Finally, an arrangement of Al- 
O-(Si-O).-Al with two Al atoms as third neigh- 
bors must be the only possible double Al-atom 
anchoring sites for Mo dioxo species. Our DFT 
results confirm that two Al atoms as fourth neigh- 
bors in Al-O-(Si-O)3-Al can serve only as two in- 
dividual single anchoring sites (77). 

Although the exact distribution of Al atoms 
among different framework sites in ZSM-5 zeo- 
lites is currently not well understood, it can be 
varied by adjusting the zeolite synthesis proce- 
dure. For example, the number of Al atoms as 
double anchoring sites in the arrangement Al-O- 
(Si-O),.-Al can be varied from 4 to 46% for ZSM-5 
samples with Si/Al = ~20, based on characteri- 
zation with hydrated Co cations (22). The frac- 
tion of Al atoms as double anchoring sites 
typically decreases, but not proportionally, with 
the increasing Si/Al ratio for the same synthesis 
procedure (22). Our evaluation of Al-O-(Si-O).-Al 
arrangements in ZSM-5 shows that these sites 
can serve as double Al-atom anchoring sites if 
they are located in the same channel, but not in 
the same plane. Additional classification of double 
Al-atom anchoring sites is provided in fig. S4 (17). 
A representative Mo(=O),”* dioxo structure on 
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an anchoring site with a pair of Al atoms in T8 
and T12 framework positions is shown in Fig. 2. 
In this nanostructure, the Mo atom is bridge- 
bonded to two framework Al atoms through two 
neighboring framework oxygen atoms and ter- 
minated with two additional oxygen atoms. The 
normal vibrational modes obtained with DFT 
calculations for these terminal oxygen atoms in 
Mo(=0),”* are summarized in Table 1. The sym- 
metric stretch (v,) is calculated to be at 992 cm. 
The calculated geometries and normal vibration- 
al modes for the Mo(=O),”* nanostructure on 
other double Al-atom anchoring sites with two 
bridging framework O atoms are similar (table S2 
and fig. S6) (7). On a site with a single framework 
Al atom, the stoichiometry of Mo oxide species 
should be Mo(=0).(OH)* to counterbalance the 
1- charge of [AlO,] and maintain Mo in the +6 
oxidation state. The vibrational mode for the sym- 
metric stretch of the terminal oxygen atoms in 
these Mo species is predicted to be at 975 cm™, 
based on evaluation of geometries and vibra- 
tional modes of the Mo(=O).(OH)* nanostructure 
anchored on single Al-atom sites in T8 (Table 1) 
and other ZSM-5 framework positions (table S1 
and fig. S5) (7). 

Raman spectroscopy gives rise to strong bands 
of symmetric stretches (v,) and weaker bands of 
asymmetric stretches (v,;), With the latter some- 
times being undetectable. In our previous studies 
of MoO;/SiO,z (19, 20), vas for Mo(=O). was not 
observed for Mo loadings below 4 wt %. Therefore, 
only v, is expected to be observed for lower Mo 
loadings. A comparison of the dominant Raman 
bands at 975 and 993 cm” in Fig. 1 with the 
calculated symmetric stretch values (v,) in Table 
1 (975 and 992 cm’) allowed us to assign these 
bands to two distinct isolated Mo dioxo species 
anchored on, respectively, single and double Al- 
atom framework sites. 

The identification of the isolated Mo oxide 
structures provided insight as to how they were 
affected by the main catalyst formulation param- 
eters: the Mo loading and Si/Al ratio. At a low 
Si/Al = 15, Mo oxide species preferentially anchored 
on sites with two Al atoms (band at 993 cm™ in 
Fig. 1B). Even at the highest Mo loading of 3.3 wt 
%, the Al/Mo atomic ratio is 2.8, which allowed 
all Mo atoms to be anchored on double Al-atom 
sites. However, when the Si/Al ratio increased, 
the number of Al atoms per unit volume of the 
zeolite decreased, and the number of sites with 
two Al atoms should have decreased more rapidly 
than the overall number of Al atoms. As a result, 
at higher Si/Al ratios of 25 and 40 in Fig. 1C, the 
dominant band was at 975 cm, arising from 
Mo(=0).OH species anchored on sites with one 
Al atom. The identification of single and double 
Al-atom anchoring sites is in agreement with 
previous findings that each Mo atom displaces 
one H* from framework [AlO,] sites in ZSM-5 
with Si/Al = 40 and two H* in ZSM-5 with Si/Al = 
15 (23). At the highest Si/Al = 140 shown in Fig. 1C, 
when the corresponding Al/Mo ratio fell below 
unity to 0.8, there were not even enough single Al- 
atom sites for stabilizing all Mo atoms. For this 
catalyst formulation, Mo oxide species were forced 
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to be stabilized, not in the zeolite pores but on the 
least preferable Si sites on the external surface of 
the zeolite. A new band at 984. cm for Si/Al = 140 
in Fig. 1C is consistent with our previous Raman 
spectra for Mo oxide species supported on amor- 
phous SiO, (19, 20). Our DFT calculations con- 
firmed that Mo dioxo species did not stabilize in 
zeolite pores in the absence of Al sites and that 
the structure of isolated Mo(=O), dioxo species 
as (Si-O-)2Mo(=O), on the external surface of the 
zeolite (Fig. 1; full details in fig. S8 and table S4) 
(17) is similar to that on SiO,. These findings are 
also supported by the in situ IR spectra of the 
surface OH region for ZSM-5 (Si/Al = 15) as a 
function of the Mo loading in fig. S2 (17). The 
intensity of the peak at 3608 cm for OH groups 
on framework Al sites (24) decreased through re- 
placement by Mo oxide species at low Mo load- 
ings, followed by a decrease in the intensity of the 
peak at 3745 cm for OH groups on the external 
surface Si sites (24) at higher Mo loadings. The 
isolated Mo oxide structures preferentially anchored 
on double Al-atom framework sites, then single 
Al-atom framework sites, and finally Si sites on 
the external surface of the zeolite. The isolated 
Mo oxide nanostructures anchored on these three 
types of zeolite sites are shown schematically in 
Fig. 1 and with 3D animation in movie S1. 
Dynamic changes of Mo nanostructures under 
reaction and regeneration conditions were eval- 
uated by simultaneously collecting operando 
Raman spectroscopy and online mass spectrom- 
etry measurements, first with CH, flow at 953 to 
1053 K (fig. S1) (17) and then under regeneration 
conditions with gas-phase O, flow at 773 K (figs. 
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$10 and S11) (77). Upon CH, introduction, CO, 
was the only initial carbon-containing product, 
and the Raman band at 993 cm for the isolated 
Mo oxide structures gradually disappeared. Be- 
cause CH, was the only reactant, Mo oxide nano- 
structures reduced to oxycarbide or carbide species. 
Several studies with different techniques, such as 
x-ray absorption fine structure, Mo Ly, edge x-ray 
absorption near-edge structure, and °*Mo NMR, 
provide direct evidence that the reduced Mo 
phase is a carbide with the stoichiometry of MoC,, 
or MoC,O, and that the initial oxide species ag- 
glomerate into particles with a size of ~0.6 nm 
(25-28). After the induction period, CO, forma- 
tion stopped, the Raman band at 993 cm for the 
initial Mo oxide species was no longer observed 
(Fig. 2B and fig. S1) (77), and the catalyst per- 
formed CH, dehydroaromatization with CgH, as 
the main hydrocarbon product. 

Our results demonstrate that an O. treatment 
can reverse both the carbide formation and the 
agglomeration of Mo nanostructures. The Raman 
spectra at 753 K for the initial catalyst with iso- 
lated Mo oxide structures and for the regenerated 
catalyst after reaction in Fig. 2 are similar, with a 
single band at 993 cm“ and ashoulder feature at 
950 cm. The similarity in the Raman band po- 
sitions and intensities before reaction and after 
regeneration indicates that the regeneration con- 
verts carbided Mo NPs into an oxide phase, 
redisperses this phase into isolated oxide nano- 
structures with a single Mo atom, and allows 
these Mo oxide species to diffuse and then stabilize 
on substantially the same zeolite anchoring sites 
as in the initial catalyst before the reaction. 


Fig. 2. Operando Raman 
spectra of 2 wt % Mo/ZSM-5 
(Si/AI = 15). Spectra (A) after 
initial pretreatment with 
gas-phase oxygen, (B) during 
reaction with methane, and 

(C) after regeneration with 
gas-phase oxygen are shown. 
The spectra demonstrate 

that the initial Mo(=O)3°* 
nanostructures anchored on 
double Al-atom framework sites 
(shown schematically on the 
right and in a zeolite pore below) 


7 with a vibrational mode at 


) 993 cm” are recovered after 
regeneration. 
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Effects of regeneration time with O, on the iden- 
tity of Mo nanostructures and on catalytic perform- 
ance with CH, after regeneration were evaluated 
by combining additional Raman spectroscopic 
measurements with reaction testing. Raman 
spectra were collected in O» flow at 773 K for 
two 1.3 wt % Mo/ZSM-5 (Si/Al = 15 and 25) 
catalysts after their deactivation in reaction 
with CH,. The evolution of Raman spectra as a 
function of regeneration time in figs. S10A and 
S11A (17) shows that isolated Mo oxide nano- 
structures were regenerated sequentially. Isolated 
Mo(=O),. species anchored on double Al-atom 
framework sites were regenerated first, as evi- 
denced by a single initial Raman band at 993 cm. 
With increased regeneration time, a second Raman 
band at 975 cm™ caused by Mo(=O).OH species 
anchored on single Al-atom sites appeared and 
grew in intensity. Finally, a third Raman band at 
984 cm™ due to Mo(=O), species anchored on Si 
sites on the external surface of the zeolite ap- 
peared and grew in intensity for the catalyst with a 
lower Al concentration in the zeolite (Si/Al = 25 
instead of 15). These direct spectroscopic observa- 
tions demonstrate that exposure to gas-phase O 
first regenerates isolated Mo oxide nanostruc- 
tures anchored on sites with two Al atoms, then 
forces these species to migrate to sites with one Al 
atom and, eventually, to Si sites on the external 
surface of the zeolite. 

A comparison of CgH, formation rates in CH, 
conversion as a function of time on stream for a 
fresh 1.3 wt % Mo/ZSM-5 catalyst (Si/Al = 25) 
versus the same catalyst after deactivation in the 
reaction with CH, and subsequent regeneration 


for 120 min (Fig. 3A) demonstrates that the cat- 
alytic performance can be fully restored. The 
C.H, formation rates after regeneration matched 
those for the fresh catalyst. Additional reaction 
results for CsHg and H, formation rates for two 
1.3 wt % Mo/ZSM-5 (Si/Al = 15 and 25) catalysts 
as a function of regeneration time (figs. S10 and 
S11) (17) show that both the overall activity and 
selectivity to CsHg fully recovered after regener- 
ation. Thus, rapid catalyst deactivation can be 
successfully addressed by regeneration with gas- 
phase O., and the catalyst lifetime can be ex- 
tended by repeated regeneration cycles. 
Correlations between the structure of the ini- 
tial Mo oxide species and catalytic performance 
can be established by comparing the evolution of 
the Raman spectra with changes in reaction rates 
as a function of regeneration time in figs. S10 and 
S11 (7). The catalytic activity was restored once 
Mo oxide nanostructures on double Al-atom 
framework sites were regenerated (after ~20 min). 
With increased regeneration time, these isolated 
Mo oxide species migrated from double to single 
Al-atom zeolite framework sites, and the catalytic 
performance with CH, remained unchanged. Fur- 
thermore, the catalytic performance of a regen- 
erated catalyst can be optimized and may exceed 
that of a fresh catalyst if the regeneration treat- 
ment is stopped before Mo oxide nanostructures 
are forced to migrate to Si anchoring sites on the 
external surface of the zeolite. Specifically for the 
1.3 wt % Mo/ZSM-5 (Si/Al = 25) catalyst, Mo oxide 
nanostructures were regenerated and moved from 
double to single Al-atom zeolite framework sites 
for regeneration times between 20 and 100 min 


Table 1. DFT-calculated vibrational modes for isolated Mo oxide species supported on ZSM-5 (cm): 
symmetric stretch (v,), asymmetric stretch (v,.), and bend (5). 


Mo species v, O=[Mo=O Vas O=Mo=O 5 O=Mo=O 
Mo(=O)2 on a double Al-atom site 992 972 388 
Mo(=O)20H on a single Al-atom site 975 962 356 


ies) 
in 


<— Regenerated 
for 100 min 


Regenerated 
for 120 min 


N N ue 
oe n —) 


Benzene formation rate, 1/s x 10+ 


in 


20. 40 60 = 80 
Time on stream, min 


Fig. 3. Catalytic activity of Mo nanostructures. (A) Benzene formation 
rates in methane conversion over a 1.3 wt % Mo/ZSM-5 (Si/AI = 25) catalyst. 
The catalytic activity declines with time on stream but is fully restored after 
120 min of gas-phase oxygen regeneration. Initial activity is enhanced by 
controlling the distribution of Mo nanostructures: Activity is higher after 
100 min of regeneration when Mo oxide nanostructures are anchored mostly 
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C-H distance: 0.11 nm 
Energy: 0 kJ/mol 


(fig. SILA) (17). Notably, Mo nanostructures re- 
mained anchored on zeolite framework Al sites 
when the regeneration was limited to this dura- 
tion, and the rates of CgsH, formation for such 
regenerated catalyst samples actually exceeded 
those for the fresh catalyst. The CsHg formation 
rates for a catalyst regenerated for 100 min in 
Fig. 3A exceeded those for the same catalyst be- 
fore deactivation during the initial time on stream 
period. In contrast, when the regeneration time 
was extended beyond 100 min, Mo oxide nano- 
structures were forced to migrate from Al frame- 
work sites to Si anchoring sites on the external 
surface of the zeolite. This change in the anchor- 
ing sites caused the catalytic activity to decrease 
to the level of the fresh catalyst, and the CgsH¢ 
formation rates for the catalyst regenerated for 
120 min (Fig. 3A) matched those for the fresh cat- 
alyst. With time on stream with CH,, the catalytic 
activity declined likely because of migration, growth, 
and coking of Mo NPs, and the performance for 
all regenerated catalysts eventually became in- 
distinguishable. However, in the first 60 min of 
time on stream, the benzene formation rates in 
Fig. 3A and fig. SI1C (17) were dependent on the 
identity of the initial Mo oxide nanostructures. 
For understanding these initial activity differ- 
ences, transition-state DFT calculations were used 
for comparing CH, activation over catalytic Mo 
carbide nanostructures anchored on the identi- 
fied three types of anchoring sites: double and 
single Al-atom zeolite framework sites and Si 
sites on the external surface of the zeolite. The 
calculations compared the first step (breaking 
of the CH3-H bond in methane), which is likely 
the rate-determining step, over the Mo carbide 
nanostructure with a stoichiometry of Mo,C, on 
the three zeolite anchoring sites (29). The CH, 
activation mechanism is similar for all anchoring 
sites, as illustrated in Fig. 3, which compares re- 
action pathways for a double Al-atom anchoring 
site inside a zeolite framework pore (Fig. 3, B, E, 
and F) and a Si site on the external surface of the 


on framework Al sites and not forced to migrate to Si sites on the external 
surface of the zeolite. (B to F) Reaction mechanism calculations show that 
the energy barrier for methane activation with the formation of CH3 and H 
species on the surface of the catalyst is lower when a Mo nanostructure is 
anchored on a framework Al site [(B), (E), and (F)] versus on a Si site on the 
external surface [(B) to (D)]. 
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zeolite (Fig. 3, B to D). The CH, initially approaches 
an exposed Mo atom, an atom that is not directly 
bonded to the zeolite. In the transition state (Fig. 3, 
Cand E), CH, forms a Mo-CH;-H-C cycle in which 
the C atom of CH, binds to the exposed Mo atom 
and, simultaneously, one of the H atoms of CH, 
binds to a C atom in the carbide. Thus, a Mo-C pair 
of atoms in the Mo carbide nanostructure serves 
as a single catalytic active site. This dual Mo-C site 
activates CH, in a scissoring motion that produces 
a CH; group bonded to Mo and an H atom bonded 
to C of the carbide (Fig. 3, D and F). Although the 
mechanism of CH, activation is similar, differences 
in geometries and electronic properties of Mo 
carbide nanostructures anchored on Al and Si 
sites lead to differences in their catalytic properties. 
The CH, activation energy over the Mo carbide 
anchored on the double Al-atom site of 112 kJ/mol 
in Fig. 3E is lower than 140 kJ/mol for the Si site 
in Fig. 3C. The transition state for the single Al- 
atom anchoring site is analogous to that for the 
double Al-atom site in Fig. 3E, with a comparable 
activation energy of 117 kJ/mol (table S6) (17). 
The CH, reaction is therefore predicted to be 
dominated by the activity of Mo nanostructures 
anchored on framework Al sites. This computa- 
tional result is consistent with known experimen- 
tal observations that the catalytic activity of Mo 
nanostructures depends strongly on the Si/Al 
ratio of the supporting zeolite and declines sub- 
stantially when Al framework sites are lost through 
dealumination (2, 8, 13-15, 30). 

The obtained information on the identity of 
Mo structures, their regeneration, and their in- 
fluence on catalytic activity opens new oppor- 
tunities for rational design of improved catalyst 
formulations and for optimizing reaction condi- 
tions for direct conversion of natural gas into 
liquid transportation fuels and valuable feed- 
stocks for the chemical industry. It is important 
to control the distribution of Mo oxide species 
and limit their anchoring to framework Al sites 
because initial Mo oxide nanostructures anchored 
on Al sites of the zeolite framework are converting 
into carbided Mo NPs with higher catalytic ac- 
tivity than those produced by initial Mo oxide 
species anchored on Si sites. The number and 
distribution of single and double Al-atom anchor- 
ing sites can be optimized by adjusting a zeolite 
synthesis procedure. The number of Si anchoring 
sites on the external surface of the zeolite can be 
reduced, or these Si sites can be eliminated com- 
pletely by adjusting the Mo deposition pro- 
cedure. Furthermore, the catalytic performance 
of Mo species and their periodic regeneration 
can be optimized by adjusting catalyst formula- 
tions (for example, with promoter metals) and 
changing the temperatures of the reaction and 
regeneration, flow rates, and other reaction 
conditions. 
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INORGANIC CHEMISTRY 


A synthetic Mn,Ca-cluster mimicking 
the oxygen-evolving center 


of photosynthesis 


Chunxi Zhang,'* Changhui Chen,” Hongxing Dong,”* Jian-Ren Shen,” 


Holger Dau,** Jingquan Zhao’ 


Photosynthetic splitting of water into oxygen by plants, algae, and cyanobacteria is 
catalyzed by the oxygen-evolving center (OEC). Synthetic mimics of the OEC, which is 
composed of an asymmetric manganese-calcium-oxygen cluster bound to protein groups, 
may promote insight into the structural and chemical determinants of biological water 
oxidation and lead to development of superior catalysts for artificial photosynthesis. We 
synthesized a Mn,Ca-cluster similar to the native OEC in both the metal-oxygen core and 
the binding protein groups. Like the native OEC, the synthetic cluster can undergo four 
redox transitions and shows two magnetic resonance signals assignable to redox and 
structural isomerism. Comparison with previously synthesized Mn3CaO,-cubane clusters 
suggests that the fourth Mn ion determines redox potentials and magnetic properties of 


the native OEC. 


he oxygen-evolving center (OEC) in photo- 
system IT (PSII) of plants, algae, and cyano- 
bacteria facilitates splitting of water into 
Os, protons, and electrons (J-4). Crystallo- 
graphic structures (5-8) reveal that the core 
of the OEC consists of a Mn3CaO, cubane motif 
and a “dangler” Mn linked via two bridging ox- 
ides, forming a distinct asymmetric Mn,Ca-cluster 
(Fig. 1A). This cluster is coordinated to four water 


molecules, one imidazole, and six carboxylate 
groups of the amino acid residues of the PSII 
polypeptides (Fig. 1C). The structure of the OEC 
as well as the oxidation states of the four manga- 
nese ions undergo changes during the water- 
oxidation reaction cycle, or S-state cycle (4, 9, 10). 
Spectroscopic results and computational chem- 
istry have provided insight in reaction interme- 
diates and mechanisms (4, 9-16). The lability of 
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the protein-bound OEC prevents electrochemical 
characterization and controlled modification of 
its structure and constituents. Only molecular 
mimics can facilitate direct measurement and 
major chemical variation of redox potentials or 
proton binding energies (pK, values); alternative 
mechanistic pathways can be explored, and spec- 
troscopic signatures of the native OEC and com- 
putational approaches can be “calibrated.” 

Numerous multi-manganese complexes have 
been synthesized as possible mimics of the OEC 
(17-22), some of which contained a Mn,CaO, clus- 
ter resembling closely the cubane part of the 
native OEC (19-21). Investigation of chemical 
variants proved useful for unravelling determi- 
nants of Mn oxidation potentials. Replacement of 
the calcium of the Mn,CaO, cubane increased 
the redox potential of the Mn'’,/Mn'’,.Mn"™ couple 
by more than 1 V (Ca”* = Sr** < Zn** < Y** < Sc?" < 
Mn**) (22). Calcium (20) or silver ions (21) were 
incorporated in the Mn4 dangling position, but 
mimicking the natural paragon by proper coor- 
dination of a fourth Mn ion was never attained. 

We synthesized complex I, [Mn,Ca0,(Bu'CO,), 
(Bu'CO.H).(py)] (Bu, tert-butyl; py, pyridine), at 
a high yield (~50%) and starting from inexpen- 
sive commercial chemicals. Briefly, a precursor of 
I was synthesized by a reaction of Bu"4NMnO, 
(Bu", n-butyl), Mn(CH3CO.)2:(H»2O),4, and Ca 
(CH3COy):H2O (molar ratio of 4: 1: 1) in boiling 
acetonitrile in the presence of an excess of pivalic 
acid. Complex I was obtained after recrystalliza- 
tion of the precursor in the presence of 2% pyridine 
in ethyl acetate (Fig. 1E). Crystallographic char- 
acterization (Fig. 1, B and D) reveals one Mn3CaO. 
cubane linked to a dangler Mn via O5, which is 
bridging between four metal ions (thus denoted 
as [4-O). This is in line with the influential first 
OEC model deduced from crystallographic data 
at 3.5 A resolution (6) and related mechanistic 
proposals (23). The asymmetric Mn,CaO,-cluster 
of I is similar to the native OEC regarding both 
structure of the Mn,Ca-oxo core and the ligating 
carboxylate groups (table S5). The integration of 
the Ca ion into the Mn3CaO, cubane as well as its 
anchoring by three carboxylates bridging to Mn1, 
Mn2, and Mn4 is identical, as is a Mn-carboxylate 
chain topology (Mn1-OCO-Mn2-OCO-Mn3-OCO- 
Mn4). In the synthetic complex, all metal ions 
that are pairwise connected by a bridging oxygen 
are also connected by a bridging carboxylate. The 
same design principle applies to the biological 
complex, with the notable exception of the Mn1- 
Mn4 connectivity. There are further differences: 
The [19-04 atom of the native OEC is replaced by 
a bridging carboxylate group, and the terminally 
coordinated water molecules are replaced by more 
complex groups. 
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Bond-valence sum (BVS) analysis of I suggests 
oxidation states of Mni?*, Mn2**, Mn3**, and Mn4?* 
(Fig. 1B and table S2), in complete analogy to the 
OEC in its S, state (Fig. 1A) (8, 10). The Mn—Mn 
distances of ~2.75 A (table S3) match x-ray ab- 
sorption fine-structure (XAFS) (4, 10) and recent 
x-ray free electron laser (XFEL) (8) results on the 
§,-state OEC. Because of its close structural sim- 
ilarity, our synthetic model supports that the 
generally shorter interatomic distances found in 
the recent XFEL study (8) [shorter than in (7)] are 
chemically reasonable for a CaMn™,,Mn'’,0.,N, 
complex (21 first-sphere O-ligands in the native 
OEC, including O-Glujg9, versus 22 in I). 

One informative difference between I and the 
native OEC is the coordination of O05, the 4-0 
atom bridging between three Mn and a single Ca 
ion, likely playing a decisive role in photosynthe- 
tic water oxidation (7, 8, 12, 14, 15, 24, 25). The 
Mn3/4-05 bond lengths (1.85/1.85 A) are shorter 
in I than in the native OEC (2.2/2.3 A). Even 
more remarkable, in the native OEC the Mn1-O5 


Natural catalyst 


distance is so long (2.7 A) that Mn1 becomes ef- 
fectively five-coordinated, with an open coordi- 
nation site that may be essential for coordination 
of a “substrate” water molecule upon oxidation 
of Mn4 from the +3 to the +4 state (8, 14, 15). 
Comparison of I and the OEC clarifies the origin 
of this prominent structural difference: In the 
synthetic complex, Mn1 and Mné4 are clamped 
together by a single bridging carboxylate, whereas 
in the OEC, they are spaced by ligation of two 
neighboring protein residues, His*”? and Glu®”’, 
to Mn, Mn3 and Mn4. Only the His*”*-Glu?™ dyad 
breaks the rule that each pair of -oxo-bridged 
metal ions also is u-carboxylato-bridged. We there- 
fore propose that in the protein, the absence of 
u-carboxylato-bridging between Mn1 and Mn4, 
possibly supported by a spacing effect of the His®”*- 
Glu? dyad, allows for formation of the function- 
ally crucial open coordination site at the Mn1 ion. 

The cyclic voltammogram (CV) of I dissolved 
in 1,2-dichloroethane (Fig. 2) shows that five re- 
dox states are accessible. Thus, the synthetic 


Artificial complex 


Reaction Freezing 


—r> 
CH;CN 


° 
: Precursor : 


(brown crystal): 
Rien oe : 


/ (boiling) weveces 


+ Reerystallization 


ethyl! acetate 
2% pyridine 


Complex I 


Fig. 1. Crystal structures of the native OEC and the synthetic Mn,Ca complex I. (A) Mn,CaOs core 
of the native OEC (8). (B) Mn4CaO, core of I. (©) Structure of the native OEC, including ligating protein 
side-chains and water molecules. (D) Structure of I, including all ligand groups. (E) Synthesis of I. 
Distances are given in angstroms; Mn, Ca, O, N, and C are shown in purple, green, orange, blue, and yellow, 
respectively. For clarity, all H atoms and pivalic CH3 groups are omitted. 
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complex can accumulate oxidation equivalents in 
close analogy to the native OEC (4, 9, 10). Although 
the native OEC is embedded in a protein envi- 
ronment that is closely mimicked in complex I, 
there are also water molecules close to the native 
metal-oxo core, which might render direct com- 
parison of redox potentials problematic. Yet, the 
voltammograms of I in 1,2-dichloroethane and 
acetonitrile are similar (fig. S8), suggesting that 
the redox potentials do not depend strongly on 
solvent polarity. Addition of trace amounts of 
water (< 1%) to I in acetonitrile could modify 
the complex through ligand exchange (fig. S12) 
and results in obscured CV oxidation and reduc- 
tion waves, possibly relating to a superimposed 
catalytic wave (figs. S9 and S10). 

The as-synthesized complex is in the S,-state. 
The S,—S, transition of I is detected at a redox 
potential [~0.8 V versus normal hydrogen elec- 
trode (NHE)], which is close to the estimated po- 
tential of the corresponding OEC redox transition 
(= 0.9 V) (3, 26) but is remarkably different from 


Fig. 2. S-state cycle of the native OEC and CV 
of I. Potentials versus NHE; scan rate of 100 mV/s. 
Native OEC is in green, peak positions of CV waves 
are in red, and estimated midpoint potentials are in 
blue. The likely oxidation states of the four Mn ions 
in various S-states of the native OEC and plausible 
oxidation states of I are indicated with black num- 
bers. A CV of lin acetonitrile and further details are 
provided in fig. S8. 
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that of the previously Mn,CaO, complex without 
a dangling Mn ion, indicating that the fourth Mn 
ion could play a crucial role in tuning the redox 
potential of the Mn,Ca-cluster (9-21). Before and 
after dissolution in organic solvent, complex I 
exhibits a broad parallel-mode electron para- 
magnetic resonance (EPR) signal (fig. $3), which 
resembles the g value and width of the S,-state 
signal of the native OEC. 

To trap the S, state of I, we used [Fe(phen)s3] 
(PF¢)3 (phen, 1,10-phenanthroline) as a chemical 
oxidant with a redox potential of +1.1 V versus 
NHE. Two S,-state signals are observed in the 
perpendicular-mode EPR spectrum (Fig. 3): (i) a 
multiline EPR signal centered at g value of 2.0, 
with a width of ~1600 G and more than 20 hy- 
perfine peaks, and (ii) a signal centered at a g 
value greater than 4, with a width of ~500 G and 
without resolved hyperfine structure. The two EPR 
signals observed are similar to the well-known g = 
2.0 and g = 4.1 EPR signals, both observed in the S.- 
state of the native OEC (9, 27, 28). Both EPR signals 


O, 


of I exhibit a linear Curie-plot behavior (fig. S6), 
suggesting that both are ground-state signals, in 
full analogy to the native OEC, in which the two 
EPR signals and their respective spin states (g = 2 
signal, S = 1/2; g = 4.1/4.9 signals, S = 3/2) have 
been proposed to arise from two OEC conformers 
differing in the localization of the Mn** ion in the 
S.-state (Mni?*Mn4"* versus Mn1**Mn4**) (28). 
The possibility of structural conformers poten- 
tially affecting the exchange coupling between 
Mn ions is suggested by subtle structural dif- 
ference between the two monomers resolved in 
the crystallographic unit cell of I. Specifically, the 
nonidentical Mni-Mné4 and Ca-O5 distances (ta- 
ble S4) may be of relevance. These differences 
were determined for the crystallographically char- 
acterized S,-state cluster. At present, it is still open 
whether and how these differences are amplified 
for the dissolved complex or through Mn oxidation 
in the S,—S, transition. A similarly close analogy 
of the magnetic properties has never been ob- 
served for cubane-type Mn3,CaO, complexes 


4Mn 4Mn 4Mn 
+3,4+3} [43,43]  |43,43 
+3,43| [43,44] [44,44 
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(19-21) nor for any other multinuclear Mn com- 
plex. The observation of a low-spin multiline sig- 
nal in I contrasts the high-spin state of Mn3CaO, 
cubane complexes (20, 29), supporting that the 
fourth Mn ion is crucial regarding the magnetic 
properties of the heterobimetallic complexes. Com- 
plex I and future variants could become valuable 
model systems in experimental and theoretical 
investigations aiming at insight in the magnetic 
properties of the biological complex, in which 
the spin localization has been suggested to be 
of relevance in the O-O bond formation chem- 
istry (14). 

Each of the four redox transition of I increases 
the potential for the subsequent oxidation step— 
for example, from 0.8 V (S;—Sz) to 1.25 V (Ss—Ss3) 
(Fig. 2). In the native OEC, the S,—S, transition is 
not coupled to a charge-compensating proton re- 
lease (11), and its redox potential increases (10) to 
an unknown level. On the basis of the general sim- 
ilarity of the S,—S, transition in I and the native 
OEC, we propose that in both, a potential increase 
of similar magnitude may occur. This potential in- 
crease in the S,—S, transition can explain why in 
PSII, a charge-compensating and redox-potential- 
lowering deprotonation of the OEC—likely coupled 


Fig. 3. EPR spectrum of 
lin its Mn'Y;Mn!"' state 
(S2-state) at 7K. The 
broad multiline signal 
(>20 lines) centered close 
to g = 2 as well as EPR 
signals at higher g values 
(g > 4) are assignable to 
the Mn complex in differ- 
ent spin states. For clarity, 
the signal at g = ~3 range 
from [Fe(phen)3]°* (figs. 
S4 and S7) was replaced 
with a dashed line. 
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to further chemical changes—precedes Mn oxi- 
dation in the S.—S, (73) and S3—S, transition 
(30). In the native OEC, redox leveling through 
deprotonation reduces the overpotential need for 
accumulation of oxidizing equivalents (J, 70). In 
the absence of efficient redox leveling, the over- 
potential need would be prohibitively high, as 
shown by the behavior of I in organic solvents 
(Fig. 2 and fig. S8). 

Although I is similar in many ways to native 
OEC, it still deviates in several functionally cru- 
cial aspects, including the water-binding sites at 
Mn4 and the Ca ion. We consider the synthesis of 
T as a first step for synthesis of further variants 
that mimic native OEC even closer. The synthetic 
flexibility is illustrated by related complexes II 
and III (Fig. 4), which have terminal ligands 
at the Ca ion or at Mn4 that are exchanged by 
alternative neutral ligands. In the native OEC, 
these terminal ligand sites are occupied by water 
molecules. The binding sites of the readily ex- 
changeable neutral ligands on Ca and the dan- 
gler Mn in the described synthetic complexes 
may serve as potential positions to create water 
binding sites that resemble closely the respective 
sites of the native OEC. Last, the complexes de- 
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Fig. 4. Structures of complexes with exchanged terminal ligands. The complexes differ from I by 
ligand exchange either on (A) Ca (II) or (B) Mn4 (III. 
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scribed here can serve as references for calcu- 
lation of vibrational spectra (fig. S13) and absolute 
OEC redox potentials; the latter is pressingly 
needed for verification of computational models 
(12, 14). 
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VACCINES 


Long-term measles-induced 
immunomodulation increases overall 
childhood infectious disease mortality 


Michael J. Mina,”?* C. Jessica E. Metcalf,”? Rik L. de Swart,* 


A. D. M. E. Osterhaus,* Bryan T. Grenfell””* 


Immunosuppression after measles is known to predispose people to opportunistic 
infections for a period of several weeks to months. Using population-level data, we show 
that measles has a more prolonged effect on host resistance, extending over 2 to 3 years. 
We find that nonmeasles infectious disease mortality in high-income countries is tightly 
coupled to measles incidence at this lag, in both the pre- and post-vaccine eras. We 
conclude that long-term immunologic sequelae of measles drive interannual fluctuations in 
nonmeasles deaths. This is consistent with recent experimental work that attributes the 
immunosuppressive effects of measles to depletion of B and T lymphocytes. Our data 
provide an explanation for the long-term benefits of measles vaccination in preventing 
all-cause infectious disease. By preventing measles-associated immune memory loss, 
vaccination protects polymicrobial herd immunity. 


easles vaccines were introduced 50 years 

ago and were followed by striking reduc- 

tions in child morbidity and mortality 

(/, 2). Measles control is now recognized 

as one of the most successful public health 
interventions ever undertaken (3). Despite this, 
in many countries vaccination targets remain 
unmet, and measles continues to take hundreds 
of thousands of lives each year (3). Even where 
control has been successful, vaccine hesitancy 
threatens the gains that have been made (J, 4). 
The introduction of mass measles vaccination 
has reduced childhood mortality by 30 to 50% 
in resource-poor countries (5-8) and by up to 
90% in the most impoverished populations (9, 10). 
The observed benefits cannot be explained by the 
prevention of primary measles virus (MV) infec- 
tions alone (JJ, 12), and they remain a central mys- 
tery (13). 

MV infection is typified by a profound, but 
generally assumed to be transient, immunosup- 
pression that renders hosts more susceptible to 
other pathogens (14-17). Thus, contemporaneous 
reductions in nonmeasles mortality after vacci- 
nation are expected. However, reductions in in- 
fectious disease mortality after measles vaccination 
can last throughout the first 5 years of life (5-0), 
which is much longer than anticipated by tran- 
sient immunosuppression, which is generally con- 
sidered to last for weeks to months (J6, 17). 

Proposed mechanisms for a nonspecific bene- 
ficial effect of measles vaccination range from 
suggestions that live vaccines may directly stimu- 
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late cross-reactive T cell responses or that they 
may train innate immunity to take on memory- 
like phenotypes (23, 18-21). Although well described 
by Aaby (11, 12) and others (22) in observational 
studies, primarily in low-resource settings, these 
effects may not fully explain the long-term ben- 
efits observed with measles vaccination and can- 
not explain the pre-vaccination associations of 
measles and infectious disease mortality we 
describe below. The World Health Organization 
(WHO) recently addressed this issue (22) and 
concluded that measles vaccination is associated 
with large reductions in all-cause childhood mor- 
tality but that there is no firm evidence to explain 
an immunological mechanism for the nonspe- 
cific vaccine benefits. 

Recent work (17, 23) invoked a different hypo- 
thesis that a loss of immune memory cells after MV 
infection resets previously acquired immunity, 
and vaccination prevents this effect. de Vries et al. 
(17) reproduced transient measles immune sup- 
pression in macaques, characterized by systemic 
depletion of lymphocytes and reduced innate im- 
mune cell proliferation (24). Although peripheral 
blood lymphocyte counts were restored within 
weeks as expected (25), the authors hypothesized 
that rapid expansions of predominantly measles- 
specific B and T lymphocytes masked an ablated 
memory-cell population (17). In other words, MV 
infection replaced the previous memory cell rep- 
ertoire with MV-specific lymphocytes, resulting 
in “immune amnesia” (77) to nonmeasles pathogens. 
Previous investigations of virus-induced memory- 
cell depletion suggest that recovery requires re- 
stimulation, either directly or via cross-reactive 
antigens (26-29). 

We propose that, if loss of acquired immuno- 
logical memory after measles exists, the resulting 
impaired host resistance should be detectable in 
the epidemiological data collected during periods 
when measles was common and [in contrast to 


previous investigations that focus on low-resource 
settings (5-12)] should be apparent in high-resource 
settings where mortality from opportunistic in- 
fections during acute measles immune suppres- 
sion was low. Relatively few countries report the 
necessary parallel measles and mortality time se- 
ries to test this hypothesis. National-level data 
from England, Wales, the United States, and 
Denmark [Fig. 1, A to C; see supplementary 
materials (SM) 1 for details], spanning the dec- 
ades surrounding the introduction of mass mea- 
sles vaccination campaigns, meet our data criteria. 
To assess the underlying immunological hypo- 
thesis (Fig. 1D) using population-level data, we 
required that first, nonmeasles mortality should 
be correlated with measles incidence data, espe- 
cially because the onset of vaccination reduces 
the latter. Second, an immune memory loss mecha- 
nism should present as a strengthening of this 
association when measles incidence data are trans- 
formed to reflect an accumulation of previous 
measles cases (a measles “shadow”). For example, 
if immune memory loss (or more broadly, immu- 
nomodulation) lasts 3 years, the total number 
of immunomodulated individuals (S) in the nth 
quarter can be calculated as the sum of the mea- 
sles cases (M) over the previous (and current) 
12 quarters: Sy = My + Mn-t0 + «..+ Mn+ + Mn. 
In practice, we weighted the quarters using a 
gamma function. Dividing S by the total popula- 
tion of interest thus provides the prevalence of 
immunomodulation (see SM 2, and 3; fig. S1, Ato 
C; and movie S1 for detailed methods). Third, the 
strength of this association should be greatest 
when the mean duration over which the cases are 
accumulated matches the mean duration required 
to restore immunological memory after MV infec- 
tion. Fourth, the estimated duration should be 
consistent both with the available evidence of 
increased risk of mortality after MV, compared 
with uninfected children, and with the time 
required to build a protective immune repertoire 
in early life (Fig. 1D, fig. S2, and SM 5 and 6). 
To explicitly address whether the observed 
nonspecific benefits of vaccination can be attri- 
buted to the prevention of MV immunomodula- 
tion, evidence for the four hypotheses must be 
present separately within the pre-vaccine eras. 
Reductions in nonmeasles infectious disease 
mortality (SM 1) are shown in Fig. 1, for children 
aged 1 to 9 years in Europe and aged 1 to 14 years 
in the United States, shortly after the onset of 
mass vaccination in each country. The fall in mor- 
tality was later in Denmark, corresponding to the 
introduction of measles vaccination in the 1980s, 
as compared to the late 1960s for the United 
Kingdom and United States. In all locations, 
measles incidence showed significant (P < 0.001) 
associations with mortality (Fig. 1, E to G). How- 
ever, effect sizes varied (fig. S3A), reflecting low 
reporting in the United States [fig. S3B and (30)] 
and changes in health care practice between eras. 
Adjusting for year as a covariate (SM 4) had little 
effect on the point estimates (fig. S3C). These as- 
sociations could reflect transient measles immune 
suppression. Thus, to address our second hypo- 
thesis that MV immunomodulation can explain 
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long-term increased mortality and, consequently, 
improved survival after vaccination, we trans- 
formed measles incidence into population preva- 
lence of MV immunomodulation, with the duration 
of the immunomodulation (the time required to 
rebuild sufficiently protective immune memory) 
defined by a gamma distribution to weight the 
previous time points summed together, as dis- 
cussed above and in SM 2 and movie S1. 
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When the gamma-distributed transformation 
was applied to the England and Wales measles 
data, the best-fit duration of MV immunomodu- 
lation, as determined by the linear fit of the cor- 
responding prevalence of MV immunomodulation 
to the mortality data (SM 3), centered at a 28.3- 
month duration of measles-induced immuno- 
modulation. This corresponded to a strong and 
significantly improved association between mea- 
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sles and all-cause infectious disease mortality 
[coefficient of determination (R?) = 0.92 versus 
0.37; P < 0.00001; Fig. 2, A to C]. Simpler addi- 
tive transformations (SM 2 and movie S1) return 
the same qualitative pattern (Fig. 2D), and adjust- 
ing for year in the model had no effect (fig. S4). 
Figure 2, E and F, shows the time series for the 
actual and predicted mortalities (calculated from 
the linear fits), together with the prevalence of 
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Fig. 1. Measles incidence, nonmeasles infectious disease mortality, and 
measles-induced immunomodulation. Nonmeasles infectious disease mortal- 
ity and measles incidence time series (A to C) and regressions (E to G) are shown 
for England and Wales, the United States, and Denmark. The vertical dashed lines in 
(A) to (C) indicate the year of introduction of the measles vaccine. (D) Measles- 
induced lymphopenia and subsequent measles-specific lymphocyte expansion in 
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the weeks after MV infection, as described in (17 23), are shown, and time is 
extended to depict hypothesized long-term immunomodulatory effects of MV 
infection and reconstitution of the immune response through individual exposures. 
Scatter plots and best-fit regression curves (plotted with 95% confidence bands) 
are shown for nonmeasles infectious disease mortality versus measles incidence 
for England and Wales (E), the United States (F), and Denmark (G). 


8 MAY 2015 * VOL 348 ISSUE 6235 695 


RESEARCH | REPORTS 


MV immunomodulation (see movie S2 for the full 
paths of these transformations). Overall, a simple 
weighted integral of measles incidence captures 
the nonmeasles mortality data remarkably well 
and more closely than the raw incidence. 

To evaluate our fourth hypothesis, we com- 
pared the best-fit gamma distribution (indicating 
average time to return of full immunity) to (i) the 
previously observed (37) duration of elevated rel- 
ative risk (RR) of all-cause mortality after inten- 
sive measles exposure (albeit in children exposed 
before 6 months of age) and (ii) to the global age 
distribution (fig. S2) of bacterial invasive disease 


in children under 5 years of age [an estimate for 
the time required to build protective immunity 
(32)]. Both the elevated RR of mortality after 
measles exposure and declines from peak rates 
of bacterial disease, indicating the development 
of sufficiently protective immunity, fell precisely 
along the gamma curve (R ? =0.97 and 0.99, re- 
spectively; Fig. 2G). 

We tested these results for nonspecific effects 
of vaccination (11) by applying the same proce- 
dures to the pre-vaccination data alone. Before 
vaccination, the best-fit duration of MV immu- 
nomodulation was no different (centered at 28.0 


versus 28.3 months; Fig. 2, B to D, and fig. S5A) 
and closely matched the duration identified in 
the post-vaccine era as well (29.2 months; fig. 
S5A). Moreover, the coefficients describing the 
slope of mortality rate versus prevalence of immu- 
nomodulation were nearly identical (Fig. 2, B and 
I, and fig. S5B). This effect held regardless of 
which era’s respective best-fit gamma transform 
was used (fig. S5, B to D), because optimizing the 
transformation using only the pre-vaccine data 
permitted accurate prediction of the post-vaccine 
era data that was not used to fit the model 
(known as out-of-sample prediction). The results 
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Fig. 2. England and Wales: Measles-induced immunomodulation and 
nonmeasles infectious disease mortality (1952-1975). Annual incidence 
of nonmeasles infectious disease mortality regressed against the prevalence of 
MV immunomodulation, given (A) no transform or (B) the best-fit gamma 
transform (that provides the best linear fit, R®, to the data). Individual regres- 
sion lines and 95% confidence intervals are plotted for regressions over the full 
data set (blue), the pre-vaccine era data only (green), and the post-vaccine era 
data only (red). R? is plotted against the mean duration of MV immunomo- 
dulation for the (©) gamma or (D) additive transformation for the full data set 
(blue lines) or the pre-vaccine data only (red lines). Inset graphs in (A) and (B) 
are the same as (C), and the location of the dots (color coded as per the 
regression lines) represent the duration of immunomodulation and the R? 
values associated with the scatter plot shown. In (E and F), the measured 
nonmeasles infectious disease mortality is plotted (broken line) along with the 
predicted annual mortalities (solid blue and red lines), predicted using the 
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B (x 10%; 95% Cl) B (x 10-4; 95% Cl) 
regression coefficients from the (H) untransformed or (I) best-fit transformed 
data. Predictions in (E) and (F) are either in sample and based on the full data 
set (blue line), or out of sample and based on the pre-vaccine data only (red 
ine). In (F), for example, the in-sample mortality prediction is made from the 
regression coefficients (I) from the best-fit transformed data for the full data 
set (blue line), and mortality is also predicted entirely out of sample for the 
post-vaccine era by optimizing the gamma transformation and calculating 
regression coefficients using only data from the pre-vaccine era (red line; fig. 
S5C). (G) The best-fit gamma distribution (optimized against the full data set) 
is shown (dark red line), along with the distribution of under-5 bacterial invasive 
disease versus age after the depletion of maternal antibodies (broken gray 
ine), and the relative risk of non-MV mortality after MV infection, described in 
(31), is also shown plotted against time since MV infection (blue points). (H) 
and (I) The regression coefficients for the best-fit lines shown in (A) and (B) are 
plotted with 95% confidence intervals in (H) and (I), respectively. 
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are robust to the chosen focal age groups (1 to 4 
and 5 to 9 years old age groups shown in fig. S6) 
and are sensitive to the specific order and mag- 
nitude of the measles epidemics, because ran- 
domizing the time series by year erased the 
above effects (fig. S7). 

Furthermore, results were robust to individual 
disease classes (table S1), with best-fit durations 
of immunomodulation predisposing to individ- 
ual classes of infectious disease mortality lasting 
between 18 and 30 months (mean, 27 months; 
median, 24 months). Exceptions were rubella 
[although rubella was not included in the prima- 
ry analysis (SM 1)] and septicemia, which had 
best-fit durations of immune memory loss of ap- 
proximately 12 and 3 months, respectively. 

Many previous investigations described stron- 
ger nonspecific benefits of MV vaccination in fe- 
males than males (7, 11, 13); thus, as an additional 
test of our hypotheses, we also compared genders. 
In agreement, we found consistently stronger 
associations among females (fig. S8). To test the 
England and Wales findings, we applied the full 
analysis to data from the United States (Fig. 3 and 
movie S3). Here, the optimized gamma trans- 
formation centered at 30.9 months (versus 28.3 


for the United Kingdom) again greatly improved 
the fit of the data (R? = 0.88 versus 0.42; Fig. 3, A 
to F), showed very strong agreement with pre- 
viously observed durations of mortality risk after 
intensive measles exposure (R? = 0.999, Fig. 3G), 
and was predicted by the age distribution of in- 
vasive bacterial infections in children under 5 
(R° = 0.921; Fig. 3G). 

We tested whether declines in mortality after 
vaccination were caused by nonspecific benefits 
of vaccination. Transformation of the data from 
the pre-vaccine era increased the strength of the 
association markedly (R? = 0.87 versus 0.07). As 
in the United Kingdom, very similar best-fit gam- 
ma transformations were obtained before (and 
after) vaccination (centered at 30.3 and 29.2 months; 
Fig. 3C and fig. S9A); the coefficients of associ- 
ation (with mortality) across vaccine eras were 
again consistent (Fig. 3, B and I, and fig. S9); the 
results were robust across age groups (fig. S10); 
the strengths of association were stronger in 
females (fig. S11); and adjusting for year had no 
effect (fig. S12). 

The data from Denmark (SM 1) were recorded 
at yearly rather than quarterly intervals and lim- 
ited the scope of analyses. Furthermore, because 


measles vaccine introduction in Denmark in 1987 
occurred as data became available and was so 
successful that within its first year, measles in- 
cidence was reduced by an order of magnitude 
(Fig. 1C), we were compelled to focus on the two 
decades that followed the logarithmic reduction 
of cases (1990-2010). Despite these limitations, 
transformation of the yearly data (using eqs. 3 
and 8 in SM 2) led to steep peaks in model fit at 
30 months duration of MV immunomodulation 
that predicted mortality (R? = 0.77; Fig. 4A). 
When we inferred quarterly incidence data from 
the annualized data [using monthly measles 
incidence data from a large population sample 
in Denmark (33) to estimate proportions of 
annual cases occurring each quarter; SM 1], the 
best fit indicated a 26.4-month duration of MV 
immunomodulation (Fig. 4, B and E), agreeing 
with the durations defined for England and Wales 
and the United States above, as well as the pre- 
viously observed risk of mortality after measles 
(Fig. 4C). 

These results provide population evidence for 
a generalized prolonged (roughly 2- to 3-year) 
impact of measles infection on subsequent mor- 


tality from other infectious diseases. Fluctuations 
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Fig. 3. USA: Measles-induced immunomodulation and nonmeasles infectious disease mortality (1943-1970). (A to 1) Plots are as described for 
England and Wales in Fig. 2. 
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in childhood mortality in the United Kingdom, 
the United States, and Denmark are explained by 
a simple weighted integral that describes the 
prevalence of measles immune memory loss and 
thus captures the impact of measles infection and 
immune depletion. We anticipate that morbidity 
data might show stronger effects. 

As a further test of the immunosuppressive 
impact of measles, we carried out a similar analy- 
sis on pertussis. Pertussis is a vaccine-preventable 
disease that is not known to be immunosup- 
pressive and for which high-quality weekly data 
(34) are available for England and Wales that 
were collected during the pre-measles vaccine 
years described above. We found no correlation 
(fig. S13) between pertussis incidence and non- 
pertussis infectious disease mortality. No corre- 
lation was observed even when the pertussis data 
were transformed to reflect the sum of previous 
pertussis cases (a pertussis “shadow”) extending 
over 48 months (fig. S13). 

Our results show that when measles was com- 
mon, MV infections could have been implicated 
in as many as half of all childhood deaths from 
infectious disease, thus accounting for nearly all 
of the interannual fluctuations in childhood in- 
fectious disease deaths. The reduction of MV in- 


Fig. 4. Denmark: Measles- 
induced immunomodulation 
and non-MV infectious dis- 
ease mortality (1990-2010). 
R? versus duration of measles- 
induced immunomodulation 
when measles data are 
transformed using (A) the addi- 
tive transformation with yearly 
intervals or (B) the gamma 
transform with quarterly inter- 
vals, where yearly measles inci- 
dence was converted to 
quarterly incidence based on 
(33). The best-fit gamma 
transform (C), as well as the 6 
predicted nonmeasles mortality, 

predicted from the untransformed 

(D) or the best-fit gamma 

transform (E) MV immuno- 

modulation data, are shown and 

are described for the respective 

figures in Figs. 2 and 3. 
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fections was the main factor in reducing overall 
childhood infectious disease mortality after the 
introduction of vaccination. 

Consistency in the best-fit duration of MV 
immunomodulation (the time required to re- 
store protective immune memory) in all three 
countries, the close fit to observed durations of 
increased mortality after intensive MV expo- 
sure, and its correspondence to the early de- 
velopment of immunity (before 5 years of age) 
through exposure all provide strong evidence 
for a measles immune effect. The similarity in 
results obtained for both pre- and post-vaccine 
eras, the qualitative consistency across ages, and 
the stronger associations in females (7) provide 
further support for an underlying immunomod- 
ulatory mechanism. Finally, these results are con- 
sistent with multiple immuno-epidemiologic and 
case-controlled studies that show reduced or ab- 
sent antigen-specific cellular responses lasting 
3 years after measles (35, 36) and reduced atopy 
even 15 years after infection (37). 

The correspondence between our results and 
previous epidemiological data by Aaby et al. (8D 
should be viewed with the caveat that the in- 
creased relative risk of mortality after intensive 
measles exposure was measured in children ex- 
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posed before 6 months of age, not all of whom 
developed features of clinical measles infections. 
Other studies (12, 38, 39) have failed to detect 
long-term immunologic sequelae of measles. These 
previous cohort studies have focused on low- 
income countries, primarily in West Africa, where 
very high rates of death from opportunistic in- 
fections during acute measles immune suppres- 
sion drive mortality dynamics and mask the 
pernicious long-term immunological effects of 
measles infection. For example, approximately 
50% of all childhood deaths recorded over 5 years 
of follow-up occurred within only 2 months of 
measles infection, precluding the detection of 
long-term sequelae in those children. 

MV infection and vaccination produce strong 
and durable herd immunity against subsequent 
epidemics (40). Our results thus suggest an extra 
dynamical twist: MV infections could also reduce 
population immunity against other infections in 
which MV immunomodulation could be envisioned 
as a measles-induced immune amnesia (17); hence, 
measles vaccination might also be preserving herd 
protection against nonmeasles infections. 

Measles vaccination is one of the most cost- 
effective interventions for global health, and our 
results imply further immunological dividends: 
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mortality (and probably morbidity) reductions 
linked to measles vaccination might be much 
greater than previously considered. This is of 
particular importance today where, especially 
in wealthy nations, reduced opportunistic infec- 
tions during acute measles immunosuppression, 
added to the comparative rarity of infection, has 
led to a public view of measles as a benign child- 
hood disease. Our findings help dispel the mys- 
tery surrounding the disproportionately large 
reductions in mortality seen after the introduction 
of measles vaccinations and reinforce the impor- 
tance of measles vaccination in a global context. 
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CHROMOSOMES 


CENP-C reshapes and stabilizes 
CENP-A nucleosomes at 


the centromere 
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Inheritance of each chromosome depends upon its centromere. A histone H3 variant, 
centromere protein A (CENP-A), is essential for epigenetically marking centromere 
location. We find that CENP-A is quantitatively retained at the centromere upon which it is 
initially assembled. CENP-C binds to CENP-A nucleosomes and is a prime candidate to 
stabilize centromeric chromatin. Using purified components, we find that CENP-C reshapes 
the octameric histone core of CENP-A nucleosomes, rigidifies both surface and internal 
nucleosome structure, and modulates terminal DNA to match the loose wrap that is found 
on native CENP-A nucleosomes at functional human centromeres. Thus, CENP-C affects 
nucleosome shape and dynamics in a manner analogous to allosteric regulation of 
enzymes. CENP-C depletion leads to rapid removal of CENP-A from centromeres, 
indicating their collaboration in maintaining centromere identity. 


entromeres direct chromosome inheritance 
at cell division, and nucleosomes containing 
a histone H3 variant, centromere protein A 
(CENP-A), are central to current models of 
an epigenetic program for specifying cen- 
tromere location (J). The centromere inheritance 
model in metazoans suggests that the high local 
concentration of preexisting CENP-A nucleosomes 
at the centromere guides the assembly of nascent 
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CENP-A, which occurs once per cell cycle after 
mitotic exit. This model predicts that after initial 
assembly into centromeric chromatin, CENP-A 
must be stably retained at that centromere; other- 
wise, centromere identity would be lost before 
the next opportunity for new loading in the next 
cell cycle. Here, we use biochemical reconstitu- 
tion to measure the shape and physical proper- 
ties of CENP-A nucleosomes with and without its 
close binding partner, CENP-C, and combine these 
studies with functional tests that reveal the mecha- 
nisms underlying the high stability of centromeric 
chromatin. 

CENP-C recognizes CENP-A nucleosomes via a 
region termed its central domain (amino acids 
426 to 537; CENP-C) (2, 3). We first considered 
how CENP-C°? may affect the overall shape of 
the CENP-A-containing nucleosome using an in- 
tranucleosomal fluorescence resonance energy 
transfer (FRET)-based approach. We designed 
an experiment to measure FRET efficiency, Prger, 
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Fig. 1. CENP-A nucleosomes have a conventional shape only upon CENP-C°? binding. Calculated | is likely that CENP-A nucleosomes sample both 
FRET efficiencies (®rret) and distances between donor and acceptor fluorophores on H2B S123C for the conformations in solution, with crystal contacts 
indicated nucleosomes on either a-satellite or Widom 601 DNA. Data are shown as the mean + SEM of | stabilizing the form that was reported (4). In 


three independent nucleosome reconstitutions. 
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Fig. 2. CENP-C° rigidifies CENP-A nucleosomes. (A) HXMS of all histone 
subunits of the CENP-A nucleosome froma single time point (10* s; see all time 
points in fig. S3). Each horizontal bar represents an individual peptide, and 
peptides are placed beneath schematics of secondary structural elements. (B) 
Regions showing substantial protection from HX mapped onto the structure of 
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the CENP-A nucleosome (PDB ID 3AN2). (C and D) Comparison of representative 
peptides spanning the B-sheet region in histone H4 and histone H2A over the time 
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each peptide is shown by the dotted line. (E) The internal H4/H2A interface mapped 
(see fig. S5) onto the canonical nucleosome crystal structure (PDB ID 1KX5). 
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nucleosomes have smaller H2B distances in 
solution (Fig. 1) that are consistent with their 
crystal structure (5). Separation of H2A/H2B 
dimers from each other is consistent with a 
nucleosome model based on rotation of the 
CENP-A/CENP-A’ interface in (CENP-A/H4), hete- 
rotetramers (6). Upon binding of CENP-C™, with 
the known stoichiometry of two CENP-C° mol- 
ecules per nucleosome (3), the H2A/H2B dis- 
tances shorten to ones that are nearly identical 
to those in conventional nucleosomes (Fig. 1). 
The differences we observed between H3 nu- 
cleosomes, CENP-A nucleosomes, and CENP-A 
nucleosomes in a complex with CENP-C°? are 
found using either the human o-satellite DNA 
sequence that corresponds to the most heavily 
occupied site at centromeres (7) or the com- 


pletely synthetic “601” nucleosome positioning 
sequence (8) (Fig. 1). 

The shape change that we measure within 
the nucleosome upon CENP-C°? binding most 
likely occurs through rotation at the four-helix 
bundles between histone dimer pairs within 
the octameric core, with interhistone contacts 
being stabilized or destabilized depending on 
the preference for rotational state. We tested this 
prediction using hydrogen/deuterium exchange- 
mass spectrometry (HXMS). Strong protection 
of CENP-A nucleosomes (Fig. 2A and figs. S3D 
and S4) is conferred by CENP-C© binding on 
peptides spanning helices that are predicted (3) 
to contact it (i.e., the a3 helix and C-terminal 
residues of CENP-A, the a2 helices of both H4 
and H2A, and regions of H2A encompassing its 
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acidic patch residues). In addition to the sur- 
face changes induced by CENP-C“, there are 
internal changes to the nucleosome that we 
measure by HX (Fig. 2, A and B, and movie S1) 
that are consistent with the change in nucleo- 
some shape that we observed by FRET (Fig. 1). 
The separation of H2A/H2B dimers in CENP-A 
nucleosomes lacking CENP-C (Fig. 1) is pre- 
dicted to weaken an internal, intermolecular B 
sheet that serves as the physical connection be- 
tween the H2A subunit on one face of the nu- 
cleosome and the H4 subunit on the opposite 
face. When CENP-C° binds to the CENP-A nu- 
cleosome, peptides spanning the corresponding 
B-sheet residues of both H2A and H4 exhibit 
extra protection from HX by 1 to 2 deuterons, 
where the same level of HX takes 5 to 10 times 
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Fig. 3. Alterations in the nucleosome terminal DNA upon CENP-C° binding. (A) Major micrococcal nuclease (MNase)-digested DNA fragments observed 
for CENP-A nucleosomes assembled on its native centromere sequence. (B) MNase digestion profiles of CENP-A nucleosomes in the absence (red) and presence 
(black) of CENP-C“°. The black arrow (0.5 min) points to the 165-bp peak (DNA crossed at the dyad). The asterisk (4 min) denotes the final 100-bp peak. (C) 
Scheme of SANS contrast variation experiment together with paired distance distribution curves for CENP-A nucleosomes alone (red) and bound by CENP-C°? 
(black) in the indicated SANS contrast variation conditions. 
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Fig. 4. Depletion of CENP-C reduces the high stability of CENP-A at cen- 
tromeres. (A to C) Cells expressing SNAP-tagged CENP-A were pulse-labeled 
with TMR* then fused with cells expressing YFP-tagged CENP-A. Representa- 
tive images (A) show a cell in the second mitosis after fusion; insets show 3x 
magnification. X-means clustering was used to classify YFP only (triangles) or 
YFP and TMR* (“x” marks) centromeres (B), and mean (+ SEM) TMR* intensity 
was calculated for each group (C). (D and E) Cells expressing high levels of 
CENP-A-PAGFP were photoactivated in bulk (box) and centromeric (circle) 


- Dox 


702 8 MAY 2015 - VOL 348 ISSUE 6235 


B C 


eoscooeo°o°er 
OpANWROADNAWOO 


Average Normalized 
TMR%* Intensity 


0 0.5 1.0 a 
Normalized YFP Intensity WS 
yer 
E CENP-A CENP-A 
> (centromere) (bulk chromatin) 
‘o 1.2 1.2 
c 
2 
£ 1.0 1.0 
oO 
a 
5 08 0.8 
[s) 
wn 
© 0.6 0.6 
(e) 
2 
uw 0.4 0.4 
no] 
() 
® 0.2 0.2 
oO 
E 0 
3 “onsh °On8h 
G 
N 
> 
O 
a 
9 
ie) 
x 
o 
> 
[v) 
a 
N 
> 
[y) 
ra) 
+ 
iw) 
fe) 
x 
oO 
> 
Oo 
a 


chromatin. Representative images (D) show a subset of centromeres in a single z 
section. Fluorescence intensity was quantified at O and 8 hours after pho- 
toactivation [(E), mean + SEM]. (F) CENP-C knockdown begins causing cell 
death 4 days post-induction (mean + SD). (G and H) Cells with (+ Dox) and 
without (— Dox) CENP-C depletion were pulse-labeled with TMR* (day 2), and 
the relative CENP-A-SNAP signals were analyzed (day 3). Quantification shows 
CENP-A-SNAP signal retained at day 3 (>2500 centromeres plotted with 
mean + SD). Scale bars, 5 um. 
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as long to occur than in CENP-A nucleosomes 
lacking CENP-C“ (Fig. 2 and figs. $5 and S6). 

Because CENP-C might also affect the extent 
that DNA wraps the nucleosomes, we reconsti- 
tuted CENP-A nucleosomes using a 195-base pair 
(bp) DNA sequence from oa-satellite DNA (9) that 
contains a contiguous sequence spanning the 
major binding site it occupies on human centro- 
meres (7) (Fig. 3A). We first overdigested CENP-A 
nucleosomes and found very strong protection of 
100 bp (fig. S9). Using a subsequent restriction 
digest of the 100-bp digestion product, we found 
that they were uniquely positioned, with their 
dyad precisely where the same-sized fragment 
previously mapped with native centromeric par- 
ticles (7) (fig. S9). CENP-A-containing nucleosomes 
have many discrete intermediate digestion pro- 
ducts before the strongly protected 100-bp frag- 
ment is generated (Fig. 3, A and B, and fig. S10). 
When CENP-C° is bound, digestion products 
larger than a nucleosome core particle [e.g., >145 bp, 
where DNA strands could cross at ~165 bp for 
conventional nucleosomes (J0)] are missing at 
early time points (Fig. 3B). This suggests that 
when CENP-C° binds to the nucleosome, the 
DNA above the dyad rarely crosses, as it would 
normally cross for conventional nucleosomes. 
Second, digestion to the 100-bp final fragment 
proceeds more quickly (Fig. 3B). Thus, transient 
unwrapping of two helical turns (i.e., ~20 bp) 
from each terminus of the nucleosome is enhanced 
when CENP-C° is bound. 

To determine whether CENP-C binding leads 
to a steady-state structural change of nucleoso- 
mal DNA, we used small-angle neutron scatter- 
ing (SANS) with contrast variation. When CENP-C 
binds to CENP-A nucleosome core particles, the 
distance distribution profiles reflecting the shape 
in solution substantially redistribute for both the 
protein- and DNA-dominated measurements (Fig. 
3C, fig. S11, and table S3). The increase in larger 
interatomic vectors for the protein component is 
expected to accompany an additional component 
(CENP-C°”). The pronounced redistribution of 
vectors to both smaller and larger distances in 
DNA-dominated scattering when CENP-C° is 
bound is attributed to compaction of the nucleo- 
some core (smaller vectors) and opening of the 
nucleosome terminal DNA when CENP-C° is 
bound (larger vectors). 

We took two complementary approaches in 
cells to determine whether CENP-A is stably re- 
tained at the centromere upon which it is initially 
deposited (see the legend for fig. S12 that describes 
the motivation for these experiments). First, we 
used cell cycle-synchronized fluorescence pulse 
labeling of CENP-A in “donor” cells and subse- 
quent cell fusion with an “acceptor” cell line. The 
donor cells express SNAP-tagged CENP-A that 
has been pulse labeled with tetramethylrhodamine- 
Star (TMR*) to irreversibly label CENP-A (1D) be- 
fore cell fusion. The acceptor cells express yellow 
fluorescent protein (YFP)-tagged CENP-A that is 
loaded at all centromeres, continuing even after 
fusion. At time points through the subsequent 
cell cycle (fig. S12) until the second mitosis 
(Fig. 4A), we observed no detectable exchange of 
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the TMR*-labeled donor CENP-A to the acceptor 
centromeres in a shared nucleoplasm. Quanti- 
tation of the fluorescence at each centromere in 
these heterokaryons yields a bimodal distribu- 
tion. The donor centromere group with high 
TMR* and low YFP (Fig. 4B, “x” symbols) has an 
average TMR* signal of 0.538 + 0.005 (normal- 
ized arbitrary units where the maximal measured 
TMR* signal in each heterokaryon equals 1) (Fig. 
4C), whereas the acceptor centromere group 
with high YFP and low TMR* (Fig. 4B, triangle 
symbols) has an average TMR%* signal of 0.055 + 
0.005 (Fig. 4C). These data indicate that once 
assembled at a centromere, an individual CENP-A 
molecule is stably maintained at that particular 
centromere. 

As a complementary approach to test CENP-A 
stability at individual centromeres, we used a 
photoactivatable version of CENP-A (CENP-A- 
PAGFP). We induced expression of CENP-A to 
the extent that it is present at locations throughout 
the nucleus, but with clear enrichment at centro- 
meres, and then activated a defined region of each 
cell nucleus (Fig. 4D, 0 hours postphotoactivation). 
CENP-A-PAGFP signal is quantitatively retained 
at the activated centromeres and does not ac- 
cumulate at unactivated centromeres (Fig. 4, 
D and E), indicating that there is negligible 
exchange between centromeres, consistent with 
our cell fusion results. In contrast, CENP-A-PAGFP 
signal in bulk chromatin decays, with about 
half of the protein removed by 8 hours after 
photoactivation. 

To investigate whether CENP-C stabilizes CENP-A. 
at centromeres, we combined SNAP labeling of 
CENP-A with CENP-C depletion (Fig. 4), for which 
we generated a cell line with a chromosomally 
integrated, doxycycline-inducible CENP-C short 
hairpin RNA cassette. In our SNAP system, CENP-C 
depletion leads to a dramatic decrease in the 
retention over 24 hours of the existing pool of 
CENP-A at centromeres (Fig. 4, G and H). With- 
out CENP-C depletion, the average retention of 
CENP-A is slightly >100% (112% + 63% SD), an 
increase that is explained by having a small pool 
of prenucleosomal CENP-A in the cell population 
that is labeled by the TMR* pulse and subsequently 
incorporated into centromeres. Nascent CENP-A 
deposition is also decreased when CENP-C is de- 
pleted (fig. S16C)—consistent with its proposed 
role in the CENP-A assembly reaction (12, 13)— 
but this would only affect incorporation of the 
small prenucleosomal pool in the CENP-A reten- 
tion measurements (Fig. 4, G and H). Thus, our 
findings implicate CENP-C in stabilizing CENP-A 
nucleosomes at centromeres. We cannot rule out 
the possibility that removal of CENP-C in turn 
removes another centromere component that sta- 
bilizes CENP-A nucleosomes, but we favor the idea 
that CENP-C is the key molecule for stabilizing 
CENP-A nucleosomes based on its direct bind- 
ing to it. 

CENP-A nucleosomes are highly stable at the 
centromeres upon which they are initially assem- 
bled. This stability is possible through collaboration 
with CENP-C. Along with the intranucleosomal 
rigidity of CENP-A and histone H4, where the 


key interfacial amino acids are important for ac- 
cumulation at centromeres (6, 14, 15), the phys- 
ical changes imposed by CENP-C combine to 
make CENP-A nucleosomes at centromeres 
very long-lived (fig. S21). Our data support a 
model of a steady-state octameric histone core 
where H2A/H2B dimers can exchange from ei- 
ther terminus of the CENP-A nucleosome. At 
the center, there is an essentially immobile 
(CENP-A/H4), heterotetramer (J6) (Fig. 4 and 
figs. S12 to S14). Thus, the physical properties 
related to CENP-A nucleosome stability at cen- 
tromeres are tied to the intrinsic properties of 
the (CENP-A/H4), heterotetramer (6, 14, 15) 
and the extrinsic properties imposed by CENP-C 
(Figs. 1 to 3 and fig. S21). 
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STRUCTURAL VIROLOGY 


Near-atomic cryo-EM structure of the 
helical measles virus nucleocapsid 


Irina Gutsche,»?* Ambroise Desfosses,*|+ Grégory Effantin,»” Wai Li Ling,**>*® 
Melina Haupt,’ Rob W. H. Ruigrok,”? Carsten Sachse,? Guy Schoehn?”**>*® 


Measles is a highly contagious human disease. We used cryo-electron microscopy and single 
particle—based helical image analysis to determine the structure of the helical nucleocapsid 
formed by the folded domain of the measles virus nucleoprotein encapsidating an RNA 

at a resolution of 4.3 angstroms. The resulting pseudoatomic model of the measles virus 
nucleocapsid offers important insights into the mechanism of the helical polymerization of 
nucleocapsids of negative-strand RNA viruses, in particular via the exchange subdomains of the 
nucleoprotein. The structure reveals the mode of the nucleoprotein-RNA interaction and 
explains why each nucleoprotein of measles virus binds six nucleotides, whereas the respiratory 
syncytial virus nucleoprotein binds seven. It provides a rational basis for further analysis of 
measles virus replication and transcription, and reveals potential targets for drug design. 


onsegmented negative-strand RNA viruses 
(nsNSVs, or Mononegavirales) cause epi- 
demics of serious respiratory tract illnesses 
[e.g., measles virus (MeV) and respiratory 
syncytial virus (RSV)] and outbreaks of 
lethal zoonotic diseases [e.g., Ebola virus and 
Nipah virus (NiV)]. Although preventable by vac- 
cination, measles still remains one of the leading 
sources of death among young children worldwide 
(1). MeV belongs to the Paramyxoviridae family 
of Mononegavirales, which is further divided in 
two subfamilies: Paramyxovirinae (containing 
MeV and NiV) and Pnewmovirinae (e.g., RSV). 
The genome of negative-strand RNA viruses 
(NSVs) is enwrapped with the viral nucleoprotein 
N. The resulting ribonucleoprotein complex, called 
the nucleocapsid, protects the viral genetic infor- 
mation while providing a flexible helical template 
for viral transcription and replication by the viral 
RNA polymerase L, which, in the Paramyxoviridae 
and Rhabdoviridae families of Mononegavirales, 
is associated with the modular phosphoprotein 
cofactor P (2). As a unique structure in nucleic acid 
biology, the NSV nucleocapsid constitutes an at- 
tractive potential target for antiviral drugs with- 
out harmful side effects. Like all Mononegavirales 
nucleoproteins, MeV N is composed of two globular 
domains, the N- and C-terminal domains (NTD 
and CTD), which together form a stable peanut- 
shaped nucleoprotein core (Ngore, residues 1 to 391 
in MeV N) holding the RNA molecule in the 
interdomain cleft. In addition, like other Para- 
myxovirinae, MeV N features a long, intrinsi- 
cally disordered tail domain (N,ai), residues 392 
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to 525) (Fig. 1A) that confers to the nucleocapsids 
a structural plasticity hampering their analysis at 
better than ~2 nm resolution (3, 4). 

Removal of the N,,i) tightens the MeV nucleo- 
capsid helix, decreasing both its diameter and 
pitch (3-5). Our previous cryo-electron micros- 
copy (cryo-EM) map of the resulting Neore-RNA 
helix at 12 A resolution (5) provided precise heli- 
cal parameters and the domain organization 
of the MeV Nyore, and a number of low- and 
medium-resolution EM reconstructions of other 
Mononegavirales nucleocapsids are available 
(6-9). However, the only atomic-resolution struc- 
tural information about Mononegavirales nucleo- 
capsids comes from x-ray crystallography of 
N-RNA rings obtained upon nonspecific encap- 
sidation of short cellular RNAs by recombinant 
nucleoproteins of RSV (7) and of two rhabdoviruses 
(0, 11), whereas such rings could not yet be ob- 
served for Paramyxovirinae. In addition, the crys- 
tal structure of the RNA-free NiV Neore (N° core) 
bound to a short N-terminal region of P (N° core- 
P50) (12), has been solved recently. We therefore 
focused on the helical state of the MeV Noore-RNA 
nucleocapsid, so as to (i) obtain a high-resolution 
three-dimensional (3D) structure of the Noore 
monomer, (ii) identify the molecular determinants 
of helical nucleocapsid polymerization, and (iii) 
directly visualize the RNA inside the nucleopro- 
tein and understand why each MeV nucleopro- 
tein binds exactly six ribonucleotides, whereas 
the RSV N binds seven. Here, we report the 3D 
structure of recombinant MeV Neore-RNA nucleo- 
capsids at 4.3 A resolution, determined by single 
particle-based helical image analysis (13, 14) 
(fig. S1). This cryo-EM map reveals the detailed 
domain architecture of the Noore, its secondary 
structure elements and many bulky side chains, 
the rationale of the nucleoprotein packing into a 
helix, and the mode of nucleoprotein-RNA inter- 
action. Combined with the atomic structures of 
the RSV N and NiV N° ore, this 3D reconstruction 
allows us to build a reliable pseudoatomic model 
of the MeV Noore-RNA helix. 


The left-handed MeV Neore-RNA helix is com- 
posed of 12.34 nucleoprotein subunits per turn, 
with a pitch of 49.54 A and an outer diameter of 
190 A, in good agreement with previous studies 
(3-5) (table S1). The RNA thread winds around 
the nucleoprotein bobbin, accommodated inside 
the cleft between the outward-pointing NTD and 
the inward-oriented CTD, and shielded from above 
by the N subunits from the successive helical turn 
(Fig. 1B). As in the nsNSV N-RNA rings, MeV 
Noore oligomerization is mainly mediated via the 
exchange subdomains called the NTD arm (resi- 
dues 1 to 36) and the CTD arm (residues 373 to 
391). Our 3D reconstruction shows how the NTD 
arm of the N; protomer inserts into a groove in 
the CTD of the N;,, subunit, whereas the CTD 
arm lies on top of the Ncrp of the N;_; subunit, 
generating a repeated helical structure (Fig. 1, C 
to F). Unlike in the model of the RSV nucleocapsid 
(15), the N;_; and N;,; subunits in the MeV Noore- 
RNA helix do not interact directly. 

The domain organization of the MeV Noeore 
(Fig. 1D) corroborates the assumption that all 
nsNSV nucleoproteins share the same global fold 
(2, 16) (fig. S2A). In particular, the CTD fold appears 
conserved among the Paramyxoviridae (MeV, 
NiV, RSV) (7, 12). The major difference between 
the pneumovirus RSV and the Paramyxovirinae 
MeV and NivV is located, as predicted (7, 12), at 
the distal tip of the NTD (residues 91 to 159 in 
MeV N). The local resolution of the cryo-EM map 
in this solvent-exposed region, known as an anti- 
genic site of MeV N (residues 122 to 150) (17), 
seems to be the lowest, probably indicating mo- 
bility with respect to the nucleocapsid core (fig. 
SID). In NiV N° ,ore-Pso crystals, the RNA-free nu- 
cleoprotein is observed in an open state (72). The 
present structure of the MeV Nore-RNA mono- 
mer enables accurate modeling of the hinge mo- 
tion between NTD and CTD, which has been 
proposed to be associated with the open-closed 
transition accompanying nucleocapsid formation 
(fig. S2, B and C) (72). 

On the basis of the NiV N°.ore-Pso structure, 
the N-terminal domain of Paramyxovirinae P 
was assigned two simultaneous roles (72): (i) 
trapping the nucleoprotein in an open RNA- 
free conformation by rigidifying the CTD, and 
(ii) preventing its polymerization by interfering 
with the binding of the exchange subdomains. 
Indeed, although crystallization of NiV No vore-Ps0 
required deletion of both the NTD and CTD 
arms, the first and second oa helices of the helix- 
kink-helix peptide of P (Pal and Pa2) were shown 
to overlay with the NTD and CTD arm loops of 
RSV N, respectively (12). In MeV, the NTD arm 
begins with an o helix (residues 2 to 14) (Fig. ID 
and fig. S2A) that fits into a hydrophobic groove 
formed by three conserved CTD a helices of the N;,1 
subunit (Fig. 2A) (2). Specifically, four aromatic 
residues (Phe™, Phe”, Tyr*°’, and Phe®™“) that are 
conserved in Paramyxovirinae (12) seem to 
stack together, thereby fixing the NTD arm a 
helix and rigidifying the helix bundle around it 
(Fig. 2B). Furthermore, the NTD arm a helix of 
MeV N;_; perfectly superimposes with Pal from 
the NiV N®ore-Pso structure, although the latter 
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is actually positioned upside down, and the CTD helix of P and the NID arm o helix of MeV N The MeV Noore-RNA structure explains bio- 
arm loop of the MeV N;,; overlaps with Pa2 (Fig. | appear to play similar roles: They lock the CTD | chemical observations (18, 19) and demonstrates 
2C). Therefore, Pol may indeed compete with the | and the NTD-CTD junction into a stable confor- | how and why each MeV N binds six ribonucleo- 
NTD arm of the N;_; protomer, and Pa2 with the | mation, either open or closed depending on the | tides [three stacked bases facing the protein (i.e., 
CTD arm of the N;,; protomer (72). The first a |! presence of RNA in the interdomain cleft. “3-bases-in”) and three stacked bases pointing 


A 1 36 264 372 391 488 499 Fig. 1. Cryo-EM structure of 
the MeV Neore-RNA nucleo- 
capsid at near-atomic reso- 
lution. (A) Schematic of MeV 
N (navy blue, NTD arm; blue, 
NTD; salmon, CTD; yellow, 
CTD arm). The same color 
code (with RNA in green) is 
used for the rest of the 
figure. (B and C) lsosurface 
representation of the cryo- 
EM 3D reconstruction of the 
helical nucleocapsid: (B) 
front view, (C) cutaway view. 
(D) Ribbon representation of 
one Neore Monomer, with the 
corresponding segmented 
cryo-EM density shown in 
transparent gray. (E and F) 
Close-up of three consecu- 
tive protomers from the 
exterior (E) and from the 
interior (F) of the helix. Scale 
bars, 50 A [(B) and (C)], 

30 A [(D) to (F)]. 


A B C MeV Ny NIV Poo 5 
CTD arm oy, 


Fig. 2. Exchange domains and Nore oligomerization. (A) Ribbon representation of two consecutive protomers (colors as in Fig. 1). (B) Close-up of the stack of 
aromatics fixing the N; subunit NTD arm on the CTD of the Nj: Subunit and rigidifying the CTD helix bundle. Pink, Phe of the N; subunit; red, Phe?©°, Tyr2°3, and 
Phe? of the N;,; subunit; transparent gray, cryo-EM density. (C) The Ps peptide (red) of NiV superposes with the NTD arm of the MeV N,_, subunit (navy blue) 
and the CTD arm of the Nj; subunit (yellow) of MeV. The gray molecular surface represents the MeV N; subunit. The N termini of N and Pso and the first and second 
a helices of Pso are indicated. 
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toward the solvent (i.e., “3-bases-out”)], whereas 
RSV N binds seven ribonucleotides (“3-bases-in” 
and “4-bases-out”) (Fig. 3, A and B, and figs. S3 
and S4). In both viruses, the “3-bases-in” of the 
N; subunit are 2-3-4, while base 1 stacks onto 
the last base of the N;-; to form an outward- 
oriented stack of three (5-6-1) or four (5-6-7-1) 
bases which crosses the N;_;-N; interface, thereby 
probably contributing to the stability of the nu- 
cleocapsid helix. Switch 1 (between bases 1 and 2) 
constrains the RNA to turn inward, and switch 2 
(between bases 4 and 5) orients the RNA out- 
ward again (Fig. 3, C to F, and fig. S3). Thus, the 
RNA topology in MeV and RSV is similar (fig. 
S4). In MeV nucleocapsid, the outwards-facing 
groove is however shorter, in particular because 
long conserved residues R194, K198, Q201, Q202 
and Y260 (Fig. 3G) point into it from the 5’ end, 
sterically hindering packing of a fourth base 
(Fig. 3, D to F, and figs. S3 and S4). Importantly, 


A 


Bs, Switch 1 


Y260, strictly conserved in Paramyxovirinae and 
featuring a well defined side-chain cryo-EM den- 
sity, does not flip away from the RNA-binding 
cleft upon RNA encapsidation as proposed (12), 
but stacks with base 2, orienting bases 2-3-4 
“in” and therefore contributing to the switch 
1 (Fig. 3, E and F). The mechanism of the switch 
2 (assured by Lys!®°, Asp’®°, and Asn?! in MeV 
N-RNA) also differs between the two viruses 
(Fig. 3C and fig. S3). Finally, as in RSV, in MeV 
the N-RNA contacts seem to be rather RNA 
backbone-specific than base-specific. In partic- 
ular, three positively charged residues, Lys'®°, 
Arg’®**, and Arg®**, show a clear side-chain den- 
sity and make hydrogen bonds to the RNA back- 
bone (Fig. 3C), and Arg” (Fig. 3D) and Lys’® 
(Fig. 3E) may contribute as well. All these basic 
residues are conserved among Paramyxovirinae 
(Fig. 3G) and are present in equivalent positions 
in RSV N (22). 


Bases ‘out’ 


MeV 3' 


oO 


190 


Switch 1 


Both the nucleoprotein protomer and the en- 
tire helical nucleocapsid are dynamic entities that 
must rearrange during the viral replication cycle. 
Our findings show how the exchange subdomains 
of N invade the adjacent subunits in the helix, 
resulting in a stable and yet flexible nucleocapsid 
assembly. The NTD arm a helix of MeV N; inserts 
into the N;,; to ensure the major stabilizing 
interaction, whereas the CTD arm not only tethers 
to the N;-; but also permits the intrinsically un- 
folded Nai) to escape outside between two helical 
turns, leading to the increased flexibility of the 
native MeV N-RNA (4, 20). The N,ay appears to 
emerge out of the nucleocapsid core in close 
proximity to the RNA belt surrounding it, so as to 
dock the P-L complex into its functional envi- 
ronment on the viral genome (2/-23). Thus, 
our structure provides a framework for under- 
standing nucleocapsid architecture and remodel- 
ing during viral transcription and replication. It 
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Switch 2 


200 

MeV 180 KAVTAPDTAADSEL-RRWIKYTQORRV 205 
NiV 178 KAVTAPDTAEESET-RRWAKYVQOKRV 203 
RSV 170 K-LAAGDRSGLTAVIRR-ANNVLKNEM 194 


Fig. 3. Nucleoprotein-RNA interaction. (A) Ribbon representation of three consecutive protomers [navy 
blue, NTD arm; yellow, CTD arm; green, RNA (bases “in”); orange (bases “out”]. (B) Schematics of the RNA 


topology for MeV and RSV. For MeV, bases “in” are in green, bases “out” in orange and pink; RSV RNA in gray. 
Positions and boundaries of successive nucleoprotein subunits on the RNA are indicated; switches in RNA conformation are shown as blue dotted lines. (C to F) 
Close-up of protein-RNA interaction (scale bar, 10 A). Protein colors are as in Figs. 1 and 2; RNA colors are the same as in (A) and (B). Numbers of relevant residues 
and bases are indicated, as are switches in RNA backbone. (G) Sequence alignment of the RNA-binding motif of MeV, NiV, and RSV N. Amino acid abbreviations: A, 
Ala; D, Asp; E, Glu; G, Gly; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg: S, Ser; T, Thr; V, Val; W, Trp; Y, Tyr. 


706 8 MAY 2015 « VOL 348 ISSUE 6235 


sciencemag.org SCIENCE 


RESEARCH | REPORTS 


may also stimulate the design of new antiviral 
drugs, because it reveals key regions interfering 
with nucleoprotein oligomerization and/or ge- 
nome encapsidation. Finally, because MeV shares 
many common features with other Paramyx- 
oviridae and with nsNSVs in general, this near- 
atomic structure of the helical MeV Noore-RNA 
nucleocapsid may be valuable for the whole 
Mononegavirales field. 
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OPTOGENETICS 


Engineering of a light-gated 


potassium channel 


Cristian Cosentino,’* Laura Alberio,' Sabrina Gazzarrini,’ Marco Aquila,’ 
Edoardo Romano,’ Solei Cermenati,’ Paolo Zuccolini,’ Jan Petersen,” Monica Beltrame,' 
James L. Van Etten,? John M. Christie,” Gerhard Thiel,* Anna Moroni‘ 


The present palette of opsin-based optogenetic tools lacks a light-gated potassium (K*) 
channel desirable for silencing of excitable cells. Here, we describe the construction 

of a blue-light-induced K* channel 1 (BLINK1) engineered by fusing the plant LOV2-Ja 
photosensory module to the small viral K* channel Kev. BLINK1 exhibits biophysical 
features of Kcv, including K* selectivity and high single-channel conductance, but reversibly 
photoactivates in blue light. Opening of BLINK1 channels hyperpolarizes the cell to the 
K* equilibrium potential. Ectopic expression of BLINK1 reversibly inhibits the escape 
response in light-exposed zebrafish larvae. BLINK1 therefore provides a single-component 
optogenetic tool that can establish prolonged, physiological hyperpolarization of cells 


at low light intensities. 


otassium ion (K*) channels have a modu- 

lar structure with sensor domains connected 

to a central ion-conducting pore (1). The 

pore integrates signals coming from the 

sensors and translates them into opening 
or closing the channel (2). This allows K* chan- 
nels to alter the membrane potential of cells in 
response to a variety of physiological stimuli. 
Extending the range of signal inputs recog- 
nized by K* channels can be achieved by graft- 
ing exogenous sensor domains onto the pore 
module (3, 4). With this modular interplay be- 
tween sensor and pore, it is possible to engineer 
synthetic channels that respond to any signal by 
ex novo coupling of sensors to pores. This strat- 
egy provides new tools for the investigation and 
manipulation of biological functions (5). An at- 
tractive synthetic channel in this context is a light- 
gated K* channel, which is important because of 
the ability of K* to terminate excitatory currents 
within cells. This device would allow remote ma- 
nipulation of the membrane potential with high 
temporal and spatial resolution and would repre- 
sent an efficient control mechanism for many 
cellular processes, including neuronal firing and 
hormone release. 

Several attempts have been made to create 
synthetic light-gated K* channels (6-9); however, 
these systems suffer from several shortcomings 
in that they require the addition of cofactors (6, 7), 
are irreversible (8), or rely on multiple compo- 
nents (9). To overcome these obstacles, we engi- 
neered a single-component light-gated K* channel 
by fusing the LOV2-Ja photosensory region of a 
plant blue-light receptor (70) to the miniature K* 
channel pore Kcv (J). Rational design and di- 
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rected evolution were employed to ultimately 
generate a blue-light-inducible K* channel that 
functions reversibly to drive cell membrane po- 
tentials to K* equilibrium in the absence of exog- 
enous cofactors. The LOV2-Ja photoswitch from 
Avena sativa phototropin 1 (hereafter LOV) can 
be used to control protein activity by light-induced 
conformational changes (72). We therefore adopted 
this strategy to place Kcv under light control. LOV 
was fused to various regions of Kcv known to be 
mechanically important for channel gating (fig. 
SIA and constructs 3 to 12 in table S1). 

A functional complementation approach based 
on the growth rescue of Atrki Atrk2 potassium 
transport-deficient yeast (strain SGY1528) (73) was 
adapted to screen for light-gated channel activity 
after replica plating (fig. SIB). One Kev variant 
with LOV fused at the N terminus (LK) showed 
light-induced growth on selective agar (4 mM K*) 
and liquid culture (fig. S1, B and C). LK was ex- 
pressed in Xenopus oocytes and tested by a two- 
electrode voltage clamp. LK currents showed 
modest but reproducible increases in conductance 
after transfer from darkness to blue light (455 nm, 
80 uW/' mm”) (fig. S2A). However, photostimulation 
of LK currents required tens of minutes to develop 
and appeared to be irreversible. In an attempt to 
enhance coupling of LOV to Kcv, the soluble pho- 
tosensory region of LK was tethered to the plasma 
membrane. Introducing a putative myristoylation/ 
palmitoylation sequence (MGCTVSAE) (J4) at the 
N terminus of LK resulted in improved, but unex- 
pected, properties. The new variant myLK (Fig. 1A 
and fig. S2B) showed an enhanced response to 
light compared with LK, but, in this case, light was 
found to inhibit rather than activate the channel 
conductance (fig. S2B). Moreover, the effect of 
light was reversible and it did not decrease after 
repetitive exposures. The light sensitivity of the 
channel was wavelength-specific, elicited by blue 
but not by red light (fig. S2B). The dynamic range 
of the light effect (DR), ie., the ratio between 
light and dark current, was approximately 1.3. To 
improve the performance of myLK, three point 
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Fig. 1. Engineering and functional characterization of light-gated myLK 
channels. (A) Cartoon representation of myLK engineered by fusing LOV to 
Kev with an additional N terminal myristoylation/palmitoylation sequence; 
myLK is shown as a monomer in the membrane. Kev comprises slide helix 
(SH), pore-helix (PH), turret (T), and transmembrane domains (TM1 and TM2) 
(23). The LOV domain includes LOV2 and Ja (10). Jagged lines indicate lipid 
anchoring to the membrane. (B) Currents recorded at —-60 mV in 100 mM [K* out, 
from oocytes expressing myLK mutants: 1532A, N538A, and myLK33. Note 
that myLK 1532A and N538A conductances are inhibited by blue light (blue 
bars, 455 nm), whereas that of myLK33 is activated. (C) Repetitive photoactivation 
of myLK33 shows the reproducibility of the effect. (D) Red light (red bars, 617 nm) 
at two light intensities, 13 and 50 uW/mm*, does not activate myLK33 current, 
whereas blue light does. (E) Current response of myLK33 to increasing light 
intensities recorded at -80 mV showing threshold (>7 uW/mm?) and satu- 
rating (35 wW/mm*) values of light intensity. (F) Dose-response curve obtained 
from n= 4 oocytes. Line indicates data fitting by a Hill-type equation, yielding a 
dissociation constant k = 25 u.W/mm*? and a Hill coefficient n = 6. (G) Kinetics 
of myLK33 activation by different intensities of blue light and (H) subsequent 
inactivation in the dark. An oocyte expressing myLK33 was exposed to a re- 
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petitive light/dark regime with increasing light intensities ranging from low 
(21 uW/mm?) to medium (29 uW/mm_?) to saturating (40 wW/mm*) intensities. 
The current responses to the three light treatments are normalized to com- 
mon starting values | = O for the onset of channel activation and / = -1 for the 
start of current decay in the dark; numbers on current traces indicate the light 
intensities in .W/mm®. Fitting of data with single exponential equation (red lines) 
yields similar to, values for low (21 uW/mm?, t= 49's), medium (29 pW/mm?, 
1= 52s), and high (40 wW/mm?, t = 59 s) light intensity. Currents decay in the 
dark with the same velocity, irrespective of pretreatment with a high (t = 148 s), 
medium (t = 137 s), or low (t = 158 s) light. (I) ton (blue symbols) and tor (black 
symbols) from a large number of independent experiments plotted as a 
function of light intensity. (J) Photoadduct formation kinetics for myL (con- 
struct 13, table S1) expressed and purified from Escherichia coli. Light-induced 
absorption changes were recorded at 450 nm (AAbs,s0 nm) in response to 
blue-light irradiation (455 nm, 90 .W/mm?) and show exponential kinetics 
(Ton = 7.6 8). (K) Dark recovery of Abs4s0 nm after light excitation shows ex- 
ponential kinetics (tof = 38 s). (L) Representative light-induced absorption 
spectra of myL recorded at 2-s intervals (upper panel) and light-minus-dark 
difference spectra recorded every 10 s (lower panel). 


mutations (G528A, 1532A, and N538A) known to 
augment LOV-effector protein interactions (15) 
were introduced into the construct, singly or in 
combination (constructs 15 to 21, table S1). Sev- 
eral of these variants exhibited robust differential 
growth over a range of selective conditions (fig. S3). 
Notably, two mutants, myLKI532A and myL- 
KN538A, had increased DR values, 1.5 and 1.9, 
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respectively (Fig. 1B), confirming that myLK archi- 
tecture enables rational protein design. 

To screen for additional DR improvements, 
random mutagenesis was performed using 
myLKN538A to generate a library of myLK- 
encoding sequences. Potassium transport- 
deficient yeasts, transformed with this mutant 


agar plates and then replica-plated onto selec- 
tive medium before exposure to darkness or light. 
Yeast expressing the parental myLKN538A chan- 
nel did not grow on media with [K*],4: below 
4 mM. Thus, variants growing below this con- 
centration were selected for further characteriza- 
tion. Thirty-five variants were obtained showing 


library, were grown initially on nonselective 


strong differential growth in the light (either 
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Fig. 2. Properties of 
BLINK1 in HEK293T 
cells. (A) Whole-cell 
currents recorded in 
HEK293T cells 
expressing myLJSK 
(BLINK1), in dark, 

blue light (455 nm, 

40 wW/mm?) and after 
returning to the dark. 
Voltage steps from 
+60 to -120 mV, tails 
at -80 mV (selected 
traces are shown 
every 40 mV). A mea- 
surable current, above 
the background, was 
found in about 8 to 
10% of the GFP-positive cells. (B) Steady state //V relation from currents in (A): dark 
(black squares), after 5 min in blue light (blue circles) and return in dark (triangles). (C) 
Currents recorded from an HEK293T cell as in (A) but with 5 mM of BaCle added in light. 
(D) Mean current-voltage relation (n = 4) from experiments as in (B): dark (black 
squares), blue light (blue circles), blue light + 1 mM BaCls (inverted triangles). (E) Mean 
Vrey Values (n = 3) of light-activated BLINK1 current plotted as a function of [K*]out: 


E F 
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wo . 


100 


slope = 58 mV/log [K*]. Pipette solution contained 130 mM K*. (F) Repetitive activation/deactivation of BLINK1 current at -70 mV by blue light/dark 
transitions. (G) Single-channel fluctuations recorded at -70 mV from the same BLINK1-expressing cell in dark (left) and in blue light (right). Dashed line 
indicates zero current level; dotted lines indicate apparent single-channel opening levels. Calculated single-channel conductance is 70 pS. 


activation or inhibition), 13 of which survived 
a second round of selection (to eliminate false 
positives) and were sequenced. The recovered 
mutations mapped throughout the randomized 
portion of myLK, but a subset clustered around 
proline 13 of Kcv (fig. S4), a residue known to 
affect channel gating (16). Notably, one partic- 
ular variant, myLK33 (construct 33, fig. S4), 
harboring a mutation in P13 (P13L) and in the 
myristoylation/palmitoylation sequence (A7T), 
was activated rather than inhibited by light, 
with a relatively large DR value (DR = 3) (Fig. 1B). 
MyLK33 was therefore chosen for extensive func- 
tional characterization. 

The channel could be repeatedly activated by 
light when expressed in oocytes without under- 
going apparent inactivation (Fig. 1C). Activa- 
tion was blue-light specific because red light 
had no perceivable effect on channel activity 
(Fig. 1D). By exposing the oocyte to increasing 
light intensities (Fig. IE), we obtained a sigmoidal 
current response with a distinct activation thresh- 
old (Fig. 1F). The mean dose-response curve was 
best fitted with a Hill function yielding a value 
for half-activation, k = 25 uW/mm?, and a Hill 
coefficient n = 6. This operational light sensi- 
tivity of the cell is less by a factor of about 500 
than that of cells expressing light-sensitive pumps 
such as NpHR (7). 

To obtain information on the kinetics of chan- 
nel activation/deactivation in light/dark, cells were 
irradiated with different intensities of blue light 
and transferred back into darkness. Activation and 
deactivation kinetics could be fitted with a single 
exponential function (Fig. 1, G and H). The time 
constants did not change over a wide range of 
light intensities, from suboptimal to saturating 
(Fig. 11). Activation (ton, 87 + 28 s, n = 52) was 
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Fig. 3. Light controls the behavior of zebrafish expressing BLINK1. (A) Altered escape response in 
2-day-old zebrafish, expressing BLINK1 or GFP. The embryos were injected at to, kept 24 hours in the dark, 
and then either exposed to blue light (80 wW/mm*) (blue) or kept in the dark (black). The escape response 
was tested by gentle mechanical stimulation with a pipette tip (see movie S1). Number of embryos (n) is 
indicated. (B) Reversibility of the effect of blue light on the escape response. Each data point represents the 
response of a batch of 2-day-old BLINK1-injected embryos, preselected for positive light response (n = 15). 
The embryos were repetitively exposed to blue light (blue circles) and dark (black circles) treatments 
(30 to 45 min each). (C) Kinetics of the light effect on the escape response of 2-day-old embryos. At time 
zero, blue light was turned on and the response to mechanical stimulation was checked every 15 min. After 
60 min, the effect reached the maximum and the light was turned off to monitor the deactivation kinetics. 
Data are from three experiments in which the n of responding embryos was 62 over a total number of 163 
(38%). Data were normalized to the maximum number of responding embryos at plateau in each 
experiment (after 60 min of light). 


about twice as fast as deactivation (tof 168 + 31 s, 
n = 43). To further examine the relationship be- 
tween channel gating and the photocycle of the 
sensor, we performed spectroscopic measurements 
on the sensor portion of the protein (myL), 
which was used to construct the channel. Global 
photoadduct formation at saturating light inten- 
sity (90 u.W/mm7?) and its recovery in the dark 
(Fig. 1, J to L) show exponential kinetics with 
values of 7.6 s for to, and 38 s for Tor. Hence, myL 
activates, under the same light conditions, about 
10 times as fast as the channel, whereas photo- 


adduct decay is about 4.5 times as fast as chan- 
nel deactivation in the dark. Thus, the primary 
light sensing by the photosensor seems to be 
only loosely correlated with channel gating. This 
agrees with the observation that other constructs 
harboring the same photoswitch as myLK33 (see, 
for example, 1532A and N538A) responded to 
light with a different kinetics (Fig. 1B). Together, 
these findings suggest that light-regulated chan- 
nel activity is mostly dominated by slow confor- 
mational changes that follow LOV2-Ja photo 
activation. 
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Experimental evidence that myLK33 retains 
the pore properties of the parental K*-selective 
channel Kcv is summarized in fig. $5. Barium, a 
known Kev channel blocker, completely inhib- 
ited both dark- and light-induced currents of 
myLK33 (fig. S5A). The current-voltage (-V) curve 
(fig. S5B) shows that light activation of myLK33 
is voltage-independent. The reversal potentials 
(Vey) of myLK33 photocurrent shifted according 
to the Nernst equation by 51 mV for a 10-fold in- 
crease in external K* concentration, a value similar 
to that of Kev (11) (fig. S5C). Moreover, exchang- 
ing 100 mM external K* with Na* shifted V,., 
to the left by 110 mV, indicating a very low Na* 
permeability (Py,/Px = 0.015 + 0.001 (n = 4) 
(fig. S5D). 

Despite these desirable features, in vivo appli- 
cation of myLK33 is hampered by its high dark 
activity. We have previously shown in other syn- 
thetic channels that the linker between the sen- 
sor and the effector module influences the coupling 
of the two (4). We therefore progressively reduced 
the linker region within myLK in an attempt to 
improve the control of the photosensor over the 
channel pore (constructs 23 to 37 in table S1 and 
fig. S6). Best results were obtained with myLJSK, 
which lacks the final nine amino acid residues 
within the Jo: helix (fig. S7A). This variant showed 
stringent differential growth in the yeast comple- 
mentation assay (fig. S7B) but was poorly ex- 
pressed in oocytes. Immunolabeling showed that 
this channel is present on the membrane of hu- 
man embryonic kidney 293T (HEK293T) cells, 
albeit at a moderate level, when compared with 
the parental channel Kcv (fig. S7C). Subsequent 
electrophysiological characterization in HEK293T 
cells demonstrated that myLJSK is activated by 
blue light (Fig. 2, A and B). Light activation is 
fully reversible in the dark (Fig. 2, A and B) and 
inhibited by Ba?* (Fig. 2, C and D). Importantly, 
myLJSK lacks any channel activity in darkness, 
as evident from the comparison of the amount of 
current in the dark and in the presence of Ba?* 
(fig. S7D). Channel opening moves the reversal 
potential of the cell with Ex, the K* equilibrium 
voltage (Fig. 2E). The light sensitivity of myLJSK 
occurs in the same range as that of myLK33, 
being saturated at ~60 uW/mm? (fig. S7E). 

Light gating of myLJSK is best appreciated in 
gap-free recordings. Repetitive exposure to blue 
light caused rapid activation followed by deacti- 
vation in darkness (Fig. 2F). Data recorded from 
single-channel fluctuations (Fig. 2G) showed a 
light-induced increase in channel activity and 
indicated a unitary conductance of about 70 pS, 
consistent with the high conductance of Kev and 
its synthetic variant KVgyntn1 (4). The high single- 
channel conductance and the relatively low mac- 
roscopic currents (<400 pA at -100 mV) are 
consistent with the low number of channels, 
which were detectable by immunolocalization 
(fig. S7C). A low number of channel proteins with 
a large unitary conductance offers the advantage 
of an efficient control over the membrane voltage 
with minimal disturbance of the cell. These prop- 
erties of myLJSK ultimately fulfilled our criteria 
for successful light gating of Kcv. Hence, we re- 
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named this variant BLINK1, blue-light-induced 
K* channel 1. Expression of BLINK1 in Sf9 insect 
cells (fig. S8A) produced fluorescence properties 
that are characteristic for LOV-containing pro- 
teins (fig. S8, B and C). Photoadduct formation 
and decay for BLINK1 was therefore analyzed by 
fluorescence spectroscopy and showed kinetics 
similar to those obtained for myLOV in vitro 
(14.5 s for to, and 51s for tos) (fig. S8, D and E). 

To determine the in vivo applicability of BLINK1 
for optogenetics, we examined its ability to regu- 
late the escape response of zebrafish embryos. 
Two-day old embryos respond to touch with a burst 
of swimming (/8). We reasoned that BLINK1 
photoactivation in zebrafish neurons (either soma- 
tosensory or motor) and/or in myocytes would 
prevent or drastically impair this behavior. Em- 
bryos injected with either BLINK1 or green fluo- 
rescent protein (GFP) RNA showed robust escape 
motions when kept in darkness [over 90% of 
embryos in both cases (BLINK1, n = 199; GFP, 7 = 
151)]. By contrast, when embryos were exposed 
to blue light, 37% of BLINK1-expressing embryos 
exhibited a reduced escape response (7 = 230) 
compared with just 9% in control larvae (7 = 170) 
(Fig. 3A and movie S1). The blue-light effect was 
fully and repetitively reverted in darkness (Fig. 
3B), as expected for a BLINK1-driven effect. The 
light-driven effect on embryonic escape motion 
developed with a half-time of 15 to 20 min and 
was reverted by dark with a similar kinetics 
(Fig. 3C). 

Several additional observations underscored 
that BLINK1 was expressed in zebrafish and that 
altered behavior of the larvae was driven directly 
by its activation in blue light. First, the presence 
of the channel is detectable by Western blotting 
in BLINK1-injected embryos (fig. S9A). Second, 
the success of eliciting escape behavior was high- 
ly dependent on the wavelength of light with red 
light (617 nm) being ineffective at evoking altered 
behavior (fig. S9B). Third, in the dark, viability 
and morphology were similar in embryos ex- 
pressing BLINK1 or GFP (wild-type embryos at 
2dpf BLINK1 = 77%, 7 = 81; GFP = 69%, n = 59), 
confirming that BLINK] is tightly closed in the 
absence of light. Taken together, these data dem- 
onstrate the ability of BLINK1 to modulate be- 
havioral responses in vivo. 

In conclusion, we have created a light-activated 
K* selective channel by combining a blue-light 
sensor and a simple K* channel pore. The result- 
ing BLINK channel is fully genetically encoded 
and does not depend on external factors for its 
light regulation, as the flavin mononucleotide 
chromophore is ubiquitously present in cells. 
BLINK! is therefore a promising tool in opto- 
genetics and has several desirable properties rel- 
ative to other light-gated pumps (17, 19, 20) and 
channels (2/7, 22) used for inhibiting the func- 
tions of excitable cells. First, in contrast to light- 
gated pumps, which move H’* or CI ions, it moves 
K", a physiological ion, down its electrochemical 
equilibrium. Thus, BLINK1 does not expose the 
cell to unphysiological hyperpolarizations or ion 
gradients. Second, its large unitary conductance 
guarantees that a small number of channels can 


efficiently decrease the input resistance of cells 
and hyperpolarize the membrane. In addition, 
the channel does not inactivate in light, which 
allows long-term control of channel activity by 
light. The apparent sensitivity of BLINK1 to light, 
combined with its slow kinetics, makes it a pow- 
erful tool for long-term inhibition of cells at low 
light intensities. Our pilot experiments in zebra- 
fish demonstrate that BLINK1 can be successfully 
used as an in vivo optogenetic tool. Besides an 
obvious application for inhibiting neuronal activ- 
ity, this channel will also find applications in the 
control of cellular processes, which require long- 
term stabilization of the membrane voltage such 
as cell cycle regulation or the control of hormone 
secretion. 
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A forward genetic screen identifies 
erythrocyte CD55 as essential for 
Plasmodium falciparum invasion 
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Efforts to identify host determinants for malaria have been hindered by the absence of a 
nucleus in erythrocytes, which precludes genetic manipulation in the cell in which the 
parasite replicates. We used cultured red blood cells derived from hematopoietic stem 
cells to carry out a forward genetic screen for Plasmodium falciparum host determinants. 
We found that CD55 is an essential host factor for P. falciparum invasion. CD55-null 
erythrocytes were refractory to invasion by all isolates of P. falciparum because parasites 
failed to attach properly to the erythrocyte surface. Thus, CD55 is an attractive target 
for the development of malaria therapeutics. Hematopoietic stem cell-based forward 
genetic screens may be valuable for the identification of additional host determinants of 


malaria pathogenesis. 


evere malaria is caused by Plasmodium 
falciparum and is one of the leading causes 
of mortality among children globally (1). 
During infection, parasites invade and rep- 
licate within human erythrocytes (2). Host 
erythrocyte polymorphisms that confer resistance 
to severe malaria have been identified with epi- 


demiologic approaches (3, 4). Genome-wide asso- 
ciation studies have searched for host determinants 
of malaria, but functional validation in the eryth- 
rocyte remains challenging owing to the absence 
of a nucleus (5-7). Recent advances in the ex vivo 
production of erythrocytes now enable generation 
of genetically altered cells that support P. falciparum 
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infection (8-17). Here, we used ex vivo-cultured 
red blood cells (CRBCs) in a forward genetic screen 
in order to identify host determinants of malaria 
infection. 

The human erythrocyte is a terminally differ- 
entiated minimal cell that lacks organelles and a 
nucleus and has a small proteome (12). To identify 
erythrocyte proteins that influence P. falciparum 
infection, we designed a screening strategy involv- 
ing RNA interference (RNAi)-based knockdown 
of gene expression in hematopoietic progeni- 
tor cells, induction of ex vivo erythropoiesis, and 
last, infection of terminally differentiated erythro- 
blasts with P. falciparum. Because gene knock- 
downs that affect erythroid development could 
have potent yet nonspecific effects on parasites 
in this approach, we first screened the erythrocyte 
proteome so as to identify genes that influence 
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Fig. 1. Pooled shRNA screen for genes that reg- 
ulate terminal erythroid differentiation. (A) Sche- 
matic of pooled shRNA screen. CD34* HSCs were 
induced toward erythroid development, transduced 
with shRNA library on Day 6, and selected with 
puromycin. On day 19, shRNA proviruses in ortho- 
chromatic erythroblasts were quantified by means 
of Illumina (San Diego, CA) sequencing. (B) Change 
in relative abundance of 5530 shRNAs in erythro- 
cyte proteome library after 19 days of differentia- 
tion. (©) RNAi gene enrichment ranking (RIGER) 
analysis of 116 candidate genes based on magni- 
tude of erythropoiesis phenotype. NES, normalized 
enrichment score. (D) Predicted localization of libra- 
ry components, with percent of genes that influence 
erythropoiesis in red. 
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terminal differentiation of erythrocytes (Fig. 1A, 
fig. SLA, and data set S1). CD34* hematopoietic 
stem cells (HSCs) were transduced with our 
pooled erythrocyte proteome short hairpin RNA 
(shRNA) library and induced to proliferate and 


differentiate to the orthochromatic erythroblast 
stage (when parasite infection can occur) (13). 
After 19 days, we quantified each shRNA in the 
surviving orthochromatic erythroblasts relative 
to the original library, with the prediction that 


shRNAs underrepresented in the differentiated 
erythroblasts would target genes important for 
erythropoiesis. 

The erythropoiesis screen yielded sufficient 
cells for full coverage of the library (>1500 cells 


per shRNA) (fig. S1B) (74), and deep sequencing 
of shRNA proviruses isolated from the terminally 
differentiated erythroid cells revealed a normal 
distribution (fig. SIC). Of shRNAs, 4.4% were de- 
pleted more than 50% in day-19 cRBCs relative to 
the original library pool, whereas 2% were en- 
riched (Fig. 1B), indicating that these shRNAs can 
influence erythropoiesis. We ranked genes ac- 
cording to the depletion or enrichment of mul- 
tiple shRNAs (15) and identified 116 candidates 
that grouped into categories relevant to terminal 
erythroid differentiation, including heme metab- 
olism, protein turnover, and apoptosis (Fig. 1C, 
figs. SID and S2, and data sets S2 and S3). We 
validated four top hits (fig. S3 and data set S4). 
This functional analysis of erythropoiesis provides 
a framework with which to study host determi- 
nants of malaria infection. 

To identify factors that influence host suscep- 
tibility to P. falciparum infection, we chose to fo- 
cus on a small subset of the erythrocyte proteome: 
42 genes encoding human blood groups. All 
known P. falciparum receptors fall within this 
group, and the shRNAs targeting these genes 
did not appear to affect erythroid development 
(Fig. 1D). Also, focusing on a small gene set in- 
creased the sensitivity to a level required for the 
inherently complex parasite screen (16). 

We transduced hematopoietic progenitor cells 
rah rah (HPCs) with a pooled lentivirus shRNA library 

targeting the blood group genes (Fig. 2A). At the 

ST8SIA6 : z 

rat rant XG late orthochromatic erythroblast stage, we in- 
XK fected the knockdown cells with a green fluores- 
cent protein (GFP)-expressing line of P. falciparum 
strain 3D7. We isolated the parasitized cells and 
quantified the relative abundance of each shRNA 
in the population by means of deep sequencing 
(fig. S4). In parallel, we quantified the abundance 
of each shRNA in a control population of knock- 
down cRBCs not exposed to parasites. Hairpins 
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Fig. 2. Pooled shRNA screen to identify host determinants of P. falciparum infection. (A) Schematic 
of blood group shRNA screen. HPCs were transduced with pooled lentivirus library expressing 308 
shRNAs targeting 42 blood group genes. Knockdown cRBCs infected with P. falciparum were sorted, and 
shRNAs were quantified by means of deep sequencing. (B) RIGER analysis ranking results for three 
independent experiments. Genes were ranked according to NES scores (green heat map). Arrows indicate 
top hits; asterisks indicate additional candidates. 
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Fig. 3. Validation of CD44 and CD55 as host factors required for P. falciparum invasion. (A) CD44 and CD55 levels on day 19/20 cRBCs expressing CD44, 
CD55, or control shRNAs (EmpT). Detection was performed by means of antibody staining and flow cytometry. (B) Morphology of differentiating cRBCs depleted 
for CD44 and CD55, detected by means of May-Grtinwald and Giemsa staining. (C) P. falciparum strain 3D7 invasion assays in control, CD44 knockdown, and 
CD55 knockdown cRBCs. Three independent biological replicates from two distinct bone marrow donors are shown. Mean + SD, n = 2 or 3 assays. *P < 0.05, one- 
tailed t test. 
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detection. (C) Invasion by laboratory- 


adapted P. falciparum strains. (D) Invasion by P. falciparum clinical isolates. Pound sign indicates thin smears showed 0.6 to 1% gametocytes. For (B) to (D), 
mean + SD, n = 3 assays. 10,000 cells scored per well by means of flow cytometry. (E) Efficiency of P. falciparum 3D7 merozoite attachment to the surface of 
CD55-null (green) versus control (blue) RBCs by using cytochalasin D (Cyt D). Invasion was measured in the absence of Cyt D.T1, 30 min; T2, 60 min; and T3, 
180 min after addition of schizonts (fig. S10). Attachment to controls at T1 was 3.6 to 5.3%. Mean + SD, n = 2 biological replicates. 20,000 cells scored per well 


by means of flow cytometry. 


underrepresented in infected cells compared with 
control cells were considered hits because they 
may target host genes required for efficient para- 
site infection. 

We ranked candidates from three replicates 
(Fig. 2B and data set S5) (15). Among the highly 
ranked genes were basigin (BSG) and CR1, both 
of which have described roles in P. falciparum 
invasion (17-19). The top-ranked candidate was 
CD55, also known as decay-accelerating factor 
(DAF), which carries the Cromer blood group an- 
tigens. CD55 is a glycosylphosphatidylinositol- 
linked complement-regulatory protein that protects 
cells from lysis by complement (20). On epithelial 
cells, CD55 is a receptor for bacterial and viral 
pathogens (27-23). Another hit, CD44, defines the 
Indian blood group and facilitates keratinocyte 
invasion by group A Streptococcus (24, 25). 

For validation, we expressed individual shRNAs 
targeting CD44 and CD55 in HPCs so as to gen- 
erate mature red blood cells deficient in CD44 or 
CD55 (Fig. 3A). Morphological development of 
the knockdown cells was indistinguishable from 
control cRBCs (Fig. 3B), as were levels of Band 3, 
CD49d, and CD71 (fig. S5) (26). To determine 
whether the amount of CD44 or CD55 on red 
blood cells influences P. falciparum infection, we 
assessed parasite invasion into the knockdown 
cRBCs compared with control cRBCs. We observed 
~30% reduction in parasitemia in cells deficient 
for either CD44 or CD55 relative to control cRBCs 
(Fig. 3C). The same trend was observed by using 
alternative shRNAs for CD44 and CD55, suggest- 
ing that the results were not because of off-target 
effects of the shRNAs (fig. S6). Levels of known 
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Table 1. Geographic distribution of CD55* coding variants as percent of population, from the 
1000 Genomes Project, phase I. CD55 isoform ENSTO00000367064 (DAF-2). 


AAA* R52L L82R Y133H A227P_ 1231V V3331 G354A G372E 
IxBTt CROM3 CROM8 CROMI1 

Pops(1092)* Exp’ 

Aggr(688) low O O 0) 00 O O O (0) 
TSI(98) low O O (0) 00 1.02 O O (0) 
ASW(61) high 4.92 0) 1.64 01.64 0 O O (0) 
CLM(60) high O O (0) 03.33 0) O 0) 1.67 
LWK(97) high 0) 0 (0) 14.43 0 (0) 1.03 (0) 
YRI(88) high 5.68 1.14 (0) 01.14 0) 1.14 0) (0) 

P value! 3.56x10°° 028 028 785x10™" 1 £028 028 0.28 


*Amino acid change and location; reference single-nucleotide polymorphism numbers are given in (13). tinter- 
national Society of Blood Transfusion type. tPopulations: Aggr, aggregated populations with no variant in 
sample (CHS, Southern Han Chinese, China; MXL, Mexican Ancestry in Los Angeles, CA; PUR, Puerto Rican in 
Puerto Rico; CEU, Utah residents with North and West European ancestry; CHB, Han Chinese in Bejing; FIN, Finnish 
in Finland; GBR, British in UK; IBS, Iberian populations in Spain; JPT, Japanese in Tokyo); TSI, Toscani in Italy; ASW, 
African Ancestry in Southwest US; CLM, Colombian in Medellin, Colombia; LWK, Luhya in Webuye, Kenya; and YRI, 
Yoruba in Ibadan, Nigeria. The numbers of individuals sampled are shown in parentheses. §Exposure of current 
or ancestral population to malaria based on World Health Organization DALY (disability-adjusted life year per 


populations with high and low exposure. 


P. falciparum receptors were unchanged in CD44- 
and CD55-deficient cRBCs, which suggests that 
the observed effects on invasion were directly at- 
tributable to CD44 and CD55 (fig. $7). 

Complete absence of Cromer blood group anti- 
gens is rare and has been described in nine indi- 
viduals with the Inab phenotype, most of whom 
have stop codons in the CD55 coding sequence 
(20). Cells from Inab individuals have normal os- 
motic fragility, do not undergo hemolysis in vivo, 


100,000 population) malaria data: high, DALY = 10; low, DALY < 10. 


Fisher's exact test comparing 


and are not particularly susceptible to complement- 
mediated lysis (27, 28). We used mature eryth- 
rocytes from a Japanese patient with the Inab 
phenotype (termed CD55-null cells) (29) to test 
whether CD55 has an essential role for P. falciparum 
invasion. To first characterize any inherent dif- 
ferences between the CD55-null and wild-type (WT) 
control samples, we adapted plasma membrane 
profiling (PMP) using tandem mass tag-based mass 
spectrometry for use in erythrocytes (30). CD55 was 
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absent from the Inab erythrocytes (Fig. 4A). Only 
two other notable differences were found between 
the CD55-null and WT cells: a 75% reduction in 
CRI, which is well within the range of natural 
variation (19), and a similar reduction in C4BPA, 
a serum protein that binds to complement regu- 
latory proteins. Only a weak association has been 
observed between parasite invasion efficiency and 
natural variation in CR1 levels, and only when all 
sialylated proteins have been removed from the 
cell surface (8, 19). 

To determine the contribution of CD55 to P. 
falciparum infection in mature erythrocytes, we 
used the CD55-null cells in invasion assays. The 
CD55-null erythrocytes were refractory to inva- 
sion by P. falciparum strain 3D7, which suggests 
that host CD55 is essential for parasite infection 
(Fig. 4B). This finding suggests that the reduced 
but substantial parasitemia observed in the cells 
in which CD55 was knocked down depended on 
residual CD55. The CD55-null cells were similarly 
nonsusceptible to invasion by additional laboratory- 
adapted P. falciparum strains, including the CR1- 
independent W2mef line (Fig. 4C). Parasite invasion 
was also significantly impaired in erythrocytes from 
a second, unrelated Inab patient with a different 
genetic basis for the condition (fig. S8A) (37). Pro- 
liferation of parasites was also diminished in these 
Inab cells (fig. S8B), which suggests that CD55 
would be required to support a productive mala- 
ria infection. 

Besides laboratory-adapted strains, several clin- 
ical isolates of P. falciparum from Senegalese pa- 
tients were also dependent on CD55 for invasion 
(Fig. 4D), providing additional evidence that CD55 
is likely a critical host factor for all P. falciparum 
strains. In contrast to P. falciparum, the zoonotic 
human malaria parasite P. knowlesi invaded CD55- 
null and WT erythrocytes with similar efficiency 
(Fig. 4B). Thus, CD55 on human erythrocytes 
may interact with a parasite ligand specific to 
P. falciparum that is not expressed by other ma- 
laria parasites. Deletion of known invasion ligands 
(EBA140, EBA175, EBA181, RH1, RH2a, or RH2b) 
did not enable parasite invasion in the absence of 
CD55 (fig. S9). 

To determine whether CD55 is involved in at- 
tachment of P. falciparum to the erythrocyte or 
at a later stage of the invasion process, we used 
cytochalasin D to enable isolation of cells with 
parasites adhered to the outer surface (32). Initial- 
ly, attachment of P. falciparum merozoites was 
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similar for the CD55-null and WT cells, but over 
time, parasites selectively detached from the cells 
lacking CD55, mirroring the invasion defect (Fig. 
4E). Thus, CD55 may not be required for the pri- 
mary interaction of merozoites with the RBC 
surface, but instead be critical for the stage of 
committed, irreversible attachment seen during 
formation of the tight junction (33). 

Our results reveal an essential role for CD55 in 
P. falciparum invasion of human red blood cells. 
Levels of CD55 vary dynamically during clinical 
malaria and may influence the course of infection 
(34). Moreover, we identified two CD55 polymor- 
phisms significantly enriched in persons with 
ancestral exposure to malaria (Table 1) (35), both 
of which have been previously described almost 
exclusively in individuals of African descent (20). 

The existence of hematologically normal indi- 
viduals completely lacking CD55 suggests that 
targeting CD55 on erythrocytes would not elicit sub- 
stantial toxicity. In light of its critical role in parasite 
infection, CD55 could serve as an attractive target 
for the development of malaria therapeutics. 
Aside from CD55, the only other known strain- 
transcendent receptor for P. falciparum, basigin, 
binds to RH5, which is a leading malaria vaccine 
candidate (17, 36, 37). Our studies also establish 
the feasibility of forward genetic cellular screen- 
ing by use of cultured erythrocytes derived from 
HSCs to identify critical host determinants of 
P. falciparum malaria biology and pathogenesis. 
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Genotyping Arrays 

The newest arrays, Axiom Porcine 
Genotyping Array and Axiom Equine 
Genotyping Array, each with more than 
600,000 markers, will be available to 
the agriculture community under the 
Axiom Expert Design Program. These 
two high-density arrays add to the 
product portfolio that includes bovine, 
chicken, maize, salmon, and wheat. 
Affymetrix has recently launched arrays 
with the newest content for genomic 
breeding in soybean and cotton, and 
arrays for buffalo, rainbow trout, rose, 
and strawberry. Also new are enhanced 
wheat arrays that have come out of the 
U.K.-based Wheat Institute Strategic 
Programme (WISP). With Affymetrix’s 
advanced bioinformatics and innovative 
design strategies, Axiom arrays routinely 
support genome-wide genotyping and 
accurately call the genotypes of both 
diploid and polyploid species. 
Affymetrix 

For info: 888-362-2447 
www.affymetrix.com/agrigenomics 


Microscopy Illumination 

The new Lumen 1600-LED from Prior 
Scientific is the latest product in 

Prior’s ever-expanding microscopy 
illumination product line. The Lumen 
1600-LED offers individual control for 16 
selectable wavelengths. With its unique 
16 selectable LED wavelengths, the 
Lumen 1600-LED is compatible with all 
simple and multiband filter sets. For the 
user who wants ease of use, the Lumen 
1600-LED has a “white light” mode with 
simple on/off and intensity buttons. 
Fitting directly to most microscopes 
the system enables for rapid switching 
between LED wavelengths to allow 

for capture of high speed events. The 
Lumen 1600-LED is controlled via a 
manual keypad controller, TTL, USB, 
analogue input for dynamic intensity 
control. Excitation filters can be fitted 


in the optical path for controlled switching with no moving parts. 
Providing 25,000+ hours of bulb life, the Lumen 1600-LED is 
an environmentally friendly illumination system that requires no 


special disposal regulations. 
Prior Scientific 

For info: 800-877-2234 
www.prior.com 


LIFE SCIENCE TECHNOLOGIES 


Trace Gas Analyzer 

The new PTR-QiTOF and the PTR-TOF 
1000 are now available, complementing 
the PTR-TOF trace gas analyzer series. 
Both instruments can be optionally 
equipped with the new fast GC add-on, 
combining Gas Chromatographic separa- 
tion with real-time trace VOC analysis. 
Featuring a Quadrupole ion guide (Qi), the 
new PTR-QiTOF achieves a performance, 
no other commercial PTR-TOFMS system 
has ever been able to demonstrate before. 
Scientists can expect up to 25x more 
sensitivity, one order of magnitude lower 
detection limit and 20% higher mass reso- 
lution. This boost in sensitivity is especial- 
ly beneficial for cutting-edge applications 
like eddy-covariance flux measurements, 
where ultralow VOC concentrations have 
to be quantified with more than 10 Hz. The 
high time-resolution and sensitivity of |O- 
NICON PTR-TOF instruments enable flux 
measurements of biogenic VOCs, provid- 
ing new insights to our understanding of 
the atmospheric VOC budget by studying 
active atmosphere-ecosystem exchange 
of a vast number of BVOCs. 

lonicon 

For info: +43-512-214-800 
www.ionicon.com/ptr-qitof 


NEW PRODUCTS 


Cathode Gauge 

The Granville-Phillips Series 500 Cold 
Cathode Gauge sets a new accuracy 
standard (+10%) for wide pressure range 
CC gauges. The Series 500 features 

a longer lifetime than traditional CC 
gauges and fast, reliable starting. Its 
patent-pending method for determin- 
ing pressure dosage over time provides 
for predictive maintenance, and USB 
connectivity assures ease of setup and 
pressure monitoring. The Series 500’s 
unprecedented accuracy is based on 
innovative technology and time-tested 
calibration techniques. It offers a wide 
measurement range from 10°'° to 10? 
Torr, and includes electronic, digital, and 
analog interfaces. The USB interface and 
GP Connect software provides setup and 
diagnostics, including screens for set- 
ting up the gauge, graphing the pressure 
trend, and checking usage statistics. 
Applications include R&D, coating, semi- 
conductor, analytical, high-energy phys- 
ics, general vacuum, and other applica- 
tions requiring long life and accuracy. 
MKS Instruments 

For info: 978-645-5500 
www.mksinst.com 


Ebola Antigens/Antibodies 

Several products have been designed 
for the development of an enzyme-linked 
immunosorbent assay (ELISA) detection 
assay for the Ebola virus. The new prod- 
ucts comprise three different monoclonal 
antibodies to the Ebola virus and two 
recombinant proteins (from the Sudan 
and Zaire strains) suitable for use in pro- 
ducing an optimized ELISA test kit for 
Ebola. An antigen-capture ELISA test for 
Ebola offers the possibility of diagnosis 
within the first few days after symptoms 
begin. Diagnosing Ebola can be a chal- 
lenge because the early symptoms are 
nonspecific to Ebola infection and are of- 
ten seen in patients with more commonly 
occurring diseases, such as malaria and 


typhoid fever. Accurate diagnostics are essential in controlling an 
outbreak. Ideally a test should be an easy-to-use and affordable 
solution that enables the rapid screening of suspected patients and 


people travelling out of an outbreak region. 


AMS Biotechnology 


For info: +44-(0)-1235-828200 


www.amsbio.com 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/products/newproducts.dtl for more information. 


Newly offered instrumentation, apparatus, and laboratory materials of interest to researchers in all disciplines in academic, industrial, and governmental organizations 
are featured in this space. Emphasis is given to purpose, chief characteristics, and availability of products and materials. Endorsement by Science or AAAS of any 
products or materials mentioned is not implied. Additional information may be obtained from the manufacturer or supplier. 
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NANJING UNIVERSITY 
OF TECHNOLOGY 


IMA AAT TE 
Overseas Talents Recruitment 


Nanjing Tech University, with a history of more than one 
hundred years, is a multidisciplinary university with a particular 
strength in engineering. 


Aiming at excellence and innovation, Nanjing Tech University is 
set to become a first-class research university with a global 
vision. We are now seeking outstanding academic and research 
leaders in the following and related fields: Basic disciplines from 
within the Physical Sciences; Cutting edge disciplines from 
within the Life Sciences; Applied disciplines from within the 
Information Sciences; Humanities represented by Management 
Science. 


Applicants should have a Ph.D. with at least 3-years research 
experience from leading universities or institutes. Candidates 
should demonstrate an internationally recognized research 
record and outstanding achievements. Successful candidates are 
expected to develop vigorous research programs and lead an 
independent research team. Successful candidates will be 
provided with a competitive relocation fee and salary package, 
generous start-up funds and spacious laboratories. 


Interested candidates should visit http://rezyb.njtech.edu.cn 
for application details. 

Phone: Ms. Wang +86-25-58139148. 

E-mail: job@njtech.edu.cn 
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NORTH CHINA ELECTRIC POWER UNIVERSITY 


Faculty Positions Available in North China 
Electric Power University, China 


Established in 1958, located in Beijing and Baoding (main campus in Beijing), North China 
Electric Power University (NCEPU) is one of the top research-oriented universities in China, 
focusing on fundamental cutting edge research and high-level education, covering diverse areas 
of science, engineering, technology, humanities, economics, management and law. NCEPU, as 
a key university jointly constructed by the Ministry of Education and the University Council, an 
organization composed of seven super-large Chinese power enterprises, China Electricity 
Council and NCEPU, is affiliated with the Ministry of Education, officially listed as one of the 
“211 Project” and “985 Project” universities as well as a “Predominant Discipline Innovation 
Platform”. It owns 1 national key laboratory, 2 state science and technology innovation 
platforms and 11 provincial and ministerial key laboratories. At present, the campus covers one 
million square meters in total, with about 3000 faculty and staff. 


North China Electric Power University is on a clear path to grow as a world-class university in 
engineering and science fields. As part of university’s further pursuit for excellence in research 
and education, we have expanded a global search for the best research talents to join us. 


North China Electric Power University invites applications for full-time Professors, Associate 
Professors and excellent scientists. Preference will be given to candidates whose research 
emphasis demonstrates the potential to complement and advance our existing research strengths 
on electric power, clean energy, energy power, mechanical engineering, automation, computer 
science, chemistry, economics, management, mathematics and physics. Successful candidates 
will be offered competitive salaries, appropriate positions and start-up funds. 


Position Openings 

Position offered by the Recruitment Program of Global Experts (1000 Plan Professorship) 
Position offered by Chang Jiang Scholars Program 

Position offered by the Recruitment Program of Global Young Experts (1000 Plan 
Professorship for Young Talents) 

eNorth China Electric Power University’s Professors 

eNorth China Electric Power University’s Associate Professors 

eNorth China Electric Power University’s Teachers 


Interested individuals should email his/her CV to ncepuscience@163.com, with “Faculty 
Application from Science” in the title. For more details, please visit our Talent Management 
Office’s website at http://rcb.ncepu.edu.cn, or contact us by email ncepuscience@163.com, or 
telephone 86-010-61772414. 


A 4 % 4 zz # Southwest Jiaotong University, P.R.China 
; na Anticipates Your Working Application 


Southwest Jactong University 


Southwest Jiaotong University (SWJTU), founded in 1896, situates itself in Chengdu, the provincial capital of Sichuan. 
It is a national key multidisciplinary “211” and “985 Feature” Projects university directly under the jurisdiction of the 
Ministry of Education, featuring engineering and a comprehensive range of study programs and research disciplines 
spreading across more than 20 faculties and institutes/centers. Boasting a complete Bachelor-Master-Doctor education 
system with more than 2,500 members of academic staff, our school also owns 2 first-level national key disciplines, 2 
supplementary first-level national key disciplines (in their establishment), 15 first-level doctoral programs, 43 first-level 
master programs, 75 key undergraduate programs, 10 post-doctoral stations and more than 40 key laboratories at national 
and provincial levels. 

Our university is currently implementing the strategy of “developing and strengthening the university by introducing and 
cultivating talents”. Therefore, we sincerely look forward to your working application. 

More information available at http://www.swjtu.edu.cn/ 

I. Positions and Requirem 

A.High-level Leading Talents 

It is required that candidates be listed in national top talents programs such as Program of Global Experts, Top Talents 
of National Special Support Program, “Chang Jiang Scholars”, China National Funds for Distinguished Young 
Scientists and National Award for Distinguished Teacher. 

Candidates are supposed to be no more than 50 years old. The limitation could be extended in the most-needed areas of 
disciplinary development. 

Candidates who work in high-level universities/institutes and reach the above requirements are supposed to be no more 
than 45 years old. 

B. Young Leading Scholars 

Candidates are supposed to be listed in or qualified to apply for the following programs: 

National Thousand Young Talents Program 

The Top Young Talents of National Special Support Program(Program for Supporting Top Young Talents) 
Science Foundation for the Excellent Youth Scholars 

Candidates should have good team spirit and leadership, outstanding academic achievements, broad academic vision and 
international cooperation experience and have the potential of being a leading academic researcher. 

C. Excellent Young Academic Backbones 

Candidates under 40 years old are expected to graduate from high-level universities/institutes either in China or other 
countries. Those who are professors, associate professors and other equal talents from high-level universities/institutes 
overseas could be employed as professors and associate professors as well. 

D. Excellent Doctors and Post Doctoral Fellows 

Candidates under 35 years old are supposed to be excellent academic researchers from high-level universities either in 
China or other countries. 

II. Treatments 

The candidates will be provided with competitive salaries and welfares that include settling-in allowance, subsidy of 
rental residence, start-up funds of scientific research, assistance in establishing scientific platform and research group as 
well as international-level training and promotion . As for outstanding returnees, we can offer further or specific 
treatments that can be discussed personally. 

III . Contact us: 

Contacts: Ye ZENG & Yinchuan LI Telephone number: 86-28-66366202 Email: talent@swjtu.edu.cn 
Address: Human Resources Department of SWJTU, the western park of high-tech zone, Chengdu, Sichuan, P.R.China, 
611756 


http://www.swjtu.edu.cn/ 


| ital 


High-level Talents Online Job Fair 2015 


The first action of 2015 
will be held on 24th May, from 7AM to 24PM in Beijing. 


Recruitment requirements: 

Scholars and doctors with working position and graduating doctors 
domestic and overseas 

Please send your curriculum vitae to: 

acabridge@gmail.com chisa.add@gmail.com chisa_yang@163.com 


For more information, please check the following web sites 


http://www.edu.cn/cv 
http://www.chisa.edu.cn/zt/yezt/network_video/index.html 


University Brief 

Founded in 1896, Xinxiang 
Medical University (XXMU) is a 
historic international research 
university system located on the 
north side of the Yellow River. 
The University comprises 15 
colleges/institutes on two main 
campuses, 5 affiliated hospitals 
(> 9,000 beds) in Henan, and 
1,080 faculty members. The 
University is renowned by both 
undergraduate and postgraduate 
education and emphasizing 
on interdisciplinary teaching 
and research worldwide. The 
research of University ranges 
from basic biomedical science to 
public health, pharmaceutical 
science, nursing, medical ad- 
ministration, psychology, _ life 
science, professional education, 
and clinical sciences. XXMU has 
established prolific international 
collaborations with Austria, Bri- 
tish, Malaysia, France, Germany, 
United States, Canada, and 
Japan. It promotes interdisciplinary 
studies, dedicated to educating 
all-rounded students to acquire a 
strong humanitarian and_in- 
novative thinking with a global 
outlook. 


Positions 


Positions are open~in_ basi 


om rok “Tai H 
biomedicines, clinical medicines 


stomatology, public health, 1 _mo- 
lecular and cellular 


XINXIANG MEDICAL UNIVERSITY 
RECRUITMENT OF PRINCIPAL INVESTIGATORS 


other related professional fields. 
In particular, the leading dis- 
ciplines of the University are 
neuroscience, psychology, cancer 
research, and immunology. The 
following applicants are en- 
couraged: assistant professors, 
associate professors, and full pro- 
fessors oversea; academicians of 


the Chinese Academy of Science 
and the Chinese Academy of 
Engineering; national candidates 
of “the Thousands of People 
Plan”, national candidates of “the 
Special Support Plan”; professors 
awarded with “The Yangtze 
Scholar”; awardees of “The 
National Science Fund for Dis 
tinguished Young Scholars”; 
chief principal investigators _of 
“The 863 and 973 Projects”; and 
awardees of “The Zhong Yuan 


Scholars”.Appointed prof 

will be qualified to spel 
“Henan Talent Professors” and 
ang Scholars”. 
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2.Start-up fund, 3-50million RMB 
Yuan. 

3.Provide one house with living 
area of 80-120 m. 
4.Independent PI is encouraged 
to recruit his/her own team 
members who will be supported 
by additional funds. 

5.The University offers appropriate 
position for family members. 
6.The University provides re- 
tirement plan and_ medical 
insurance at the national level. 
7.Aforementioned'§ salary and 
benefits are negotiable for the 
applicants who will work as 
part-time hires. 


Application Procedure 
Interested candidates are invited 
to send: 

1) a research plan for the next 5- 
10 years; 

2) full CV with degree certifications 
and web link; 

3) three letters of recommendation ; 
4) three to five publications and/or 
patents with a description of 
your academic contributions to 
Mr. Yu or Mr. Gu, 601 Jinsui Avenue, 
Xinxiang, Henan, 453003. 
+(86)373-302-9007 (or 9163), 
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NX) Wake Forest” 
School of Medicine 


The Department of Neurobiology and Anatomy at Wake Forest 
Medical School announces open positions for 


Postdoctoral/Predoctoral Training in 
Multisensory Processes 


We seek strong candidates for postdoctoral and predoctoral training 
funded by an NIH T32 Training Grant. The training program 
provides a rich collaborative research environment that fosters 
interdisciplinary approaches to understanding how the brain 
integrates information from multiple senses to produce perception 
and adaptive behavior. Candidates with direct experience as well 
as those in related fields are encouraged to apply. Trainees will 
have access to any of 10 laboratories using human subjects and/ 
or a variety of animal models (rodents-primates) with approaches 
spanning molecular/cellular to perceptual/behavioral. Fellowships 
are awarded on a competitive basis. 


Applications including a current curriculum vitae or nominations 
should be sent to the Training Grant Director: Dr. Barry E. Stein 
bestein@wakehealth.edu, or to its Co-Directors: Dr. Terrence 
Stanford (stanford@wakehealth.edu) and Dr. Dwayne Godwin 
(dgodwin@wakehealth.edu). A description of the faculty and the 
program can be accessed via the website: http://graduate.wfu.edu/ 
admissions/t32/training_tpmp.html 


Wake Forest School of Medicine is an Affirmative Action/Equal 
Opportunity Employer and especially encourages applications 
from women and minority candidates. 


~ 7 
STANFORD 
SCHOOL OF MEDICINE 
Stanfard University Medical Center 
Stanford University School of Medicine seeks to recruit a new faculty member 
to join the Institute of Stem Cell Biology and Regenerative Medicine as an 
Assistant Professor in the University Tenure Line. There will be a co-primary 
appointment in an appropriate department to be considered in conjunction with 
the successful applicant. Candidates must have a Ph.D. in computer science, 
electrical engineering, bioinformatics or a related field. The search committee 
encourages applications from candidates who have research interests in clinical 
or translational informatics, bioinformatics, or related subdisciplines, and who 
have or will develop an independent research program in the computational 
sciences. 


¢ The predominant criterion for appointment in the University Tenure Line 
is a major commitment to research and teaching. 


The successful candidate will conduct a vigorous, independent research program, 
teach in the classroom, and advise students in any of a number of graduate 
programs in computational sciences. 

Faculty members appointed as Assistant Professors will have completed one 
or two years of postdoctoral research experience. Their accomplishments 
during graduate and postgraduate training should already have stamped them 
as creative and promising investigators. Preference will be given to candidates 
who have experience in developing novel analytical tools to analyze the cellular 
composition of organs and/or tissues and who can complement and enhance 
existing strengths in stem cell biology, cancer biology, developmental biology, 
bioinformatics, and bioengineering within the Institute and the University. 


Submit CV, a brief statement of research objectives and the names of three 
references to: 
Dr. Michael F. Clarke 
Chair, Stem Cell Computation Search Committee 
Institute for Stem Cell Biology and Regenerative Medicine 
Email: stemcellcomputationsearch@stanford.edu 


Stanford University is an Equal Opportunity Employer and is committed 

to increasing the diversity of its faculty. It welcomes nominations of and 
applications from women and members of minority groups, protected veterans 
and individuals with disabilities, as well as others who would bring additional 

dimensions to the university 5 research, teaching and clinical missions. 
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= Exercises to help you examine your skills, interests, and values 
" Alist of 20 scientific career paths with a prediction of which ones best fit your skills and interests 
« Atool for setting strategic goals for the coming year, with optional reminders to keep you on track 
" Articles and resources to guide you through the process 
" Options to save materials online and print them for further review and discussion 

Ability to select which portion of your IDP you wish to share with advisors, mentors, or others 


societies and the NIH " Acertificate of completion for users that finish myIDP. 


Visit the website and start planning today! 
myIDP.sciencecareers.org 
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THE ONSAGER FELLOWSHIPS 


12 tenure-track positions available at NINU 


The Norwegian University of Science and Technology (NTNU) is Norway's primary 
institution for educating the future’s engineers and scientists. The university also 
has strong programmes in the social sciences, teacher education, the arts and 
humanities, medicine, architecture and fine art. NTNU’s cross-disciplinary 
research delivers creative innovations that have far-reaching social and economic 
impact and that help contribute to a better world. 


The Onsager Fellowship programme at NTNU is designed to attract the most 
talented scholars with an established reputation for high quality research and a 


commitment to learning and teaching at the university level. 


APPLY FOR A TENURE-TRACK POSITION AS AN ASSOCIATE PROFESSOR IN: 

e Linguistics e Safety and reliability of complex systems 

© Robotic vision e Marine structures for the future - marine technology 
e Molecular biodiversity e Zero emission refurbishment of the built environment 


e Medicine - bioinformatics ¢ Economics of natural resources and quantitative 


e Medicine - molecular biology pete regen 


e Statistical machine learning 


¢ Theoretical condensed matter physics More info at: www.ntnu.edu/onsagerfellowship 
e Inorganic or hybrid functional materials Closing date: 25 May. 


NTNU - Trondheim 


Norwegian University of TOP RESEARCH DEMANDS BRILLIANT MINDS 
Science and Technology - WE'RE ALWAYS LOOKING FOR THE BEST 
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Reserve space by May 26* 


FHERE’*S- A SCIENCE-FO- REACHING SCIENTISTS. 


What makes Science the best choice? 
Read and respected by 570,400 readers around the globe 
78% of readers read Science more often than any other journal 
Your ad sits on specially labeled pages to draw attention to the ad 


Your ad dollars support AAAS and its programs, which strengthens the global 
scientific community. 


Why choose this biotechnology section for your advertisement? 
Relevant ads lead off the career section with special Biotechnology banner 
Bonus distribution to: 


BIO International Convention 
June 15-18, 2015, Philadelphia, PA 


BIO Career Fair 
June 18, 2015, Philadelphia, PA. 


Expand your exposure. Post your print ad online to benefit from: - 
Link on the job board homepage directly to biotechnology jobs Science 


Dedicated landing page for jobs in biotechnology 
Additional marketing driving relevant job seekers to the job board. 


* Ads accepted until June 8 on a first-come, first-served basis. 


SCIENCECAREERS.ORG 
To book your ad: advertise@sciencecareers.org 
The Americas Japan 
202-326-6582 +81-3-3219-5777 
Europe/RoW 


China/Korea/Singapore/Taiwan 
FROM THE JOURNAL SCIENCE JAVAAAS +44 (0)1223-326500 


+86-186-0082-9345 


FUNDING OPPORTUNITIES — U.S. Department of Defense 


Defense Medical Research and Development Program 


Peer Reviewed Medical Research Program 


The Peer Reviewed Medical Research Program (PRMRP) funds exceptional research with the goal to improve the health and well-being of all military 
Service Members, Veterans, and their beneficiaries. The PRMRP received $247.5 million in fiscal year 2015 (FY15) and seeks grant applications in 
the following topic areas: 
Acupuncture DNA vaccine technology for Integrative medicine Post-traumatic osteoarthritis 
Acute lung injury post-exposure prophylaxis Interstitial cystitis Psychotropic medications 
Advanced prosthetics Dystonia Lupus Pulmonary fibrosis 
Arthritis Focal segmental glomerulosclerosis Malaria Respiratory health 
Burn pit exposure Food allergies Metals toxicology Rheumatoid arthritis 
Cardiovascular health Fragile X syndrome Mitochondrial disease Scleroderma 
Chronic migraine and Healthcare-acquired infection reduction — Nanomaterials for bone regeneration Sleep disorders 
post-traumatic headache Hepatitis B Osteoarthritis Tinnitus 
Congenital heart disease Hereditary angioedema Pancreatitis Vascular malformations 


Dengue Hydrocephalus Pathogen-inactivated dried plasma Women’s heart disease 
Diabetes Inflammatory bowel disease Polycystic kidney disease 


Descriptions of the FY15 PRMRP Program Announcements and * Clinical Trial Award ° Investigator-Initiated Research Award 
General Application Instructions are anticipated to be posted on * Discovery Award * Technology/Therapeutic Development Award 
Grants.gov by mid-April 2015: * Focused Program Award 


All applications must conform to the Program Announcements and General Application Instructions that will be available for electronic downloading from the Grants.gov website 
(all viewable under CFDA number 12.420). Execution management support will be provided by the Congressionally Directed Medical Research Programs. 


http://cdmrp.army.mil 
http://cdmrp.army.mil/funding/prmrp.shtml 


Director, Materials Research PARTOUT OUILY ADE L’AVENIR 


Laboratory & Professor 
College of Engineering 
University of Illinois at Urbana-Champaign 


The College of Engineering at the University of Illinois at Urbana- 
Champaign invites applications for the position of Director of the 
Frederick Seitz Materials Research Laboratory (MRL). The MRL is a 
state-of-the-art research facility that fosters interdisciplinary research 
at the forefront of materials science. MRL brings together world-class 
faculty and students in condensed matter physics, materials chemistry, 
and materials science in a highly collaborative research environment. 
Participants in the MRL come from many departments across the 
College of Engineering and the campus. For more information on the 
MRL, please visit their website at http://mrl.illinois.edu/. 


This position provides overall leadership and a long-term vision to 
leverage the capabilities of the MRL. Candidates must be eligible to 
hold a tenured full Professor position within the College of Engineering 
at the University of Illinois. The desired start date for this position is 
as soon as possible. To ensure full consideration, applications should 
be received by September 1, 2015, but applications will be accepted << 
until the position is filled. Please visit http://jobs.illinois.edu to view — 
the complete position announcement and application instructions. 
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environment (http://provost. illinois. edu/worklife/index. htm). 1° institut européen pour les sciences de l’animal et du végétal 
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POSITIONS OPEN 
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FACULTY POSITIONS-MEDICAL SCHOOL 0 i, Ce lal 
The Saint James School of Medicine, an international 
medical school (website: http://www.sjsm.org), 
invites applications from candidates with teaching 
and/or research experience in any of the basic medical 
sciences for its Caribbean campuses. Faculty positions 
are currently available in Pathology, Histology, and 
Anatomy. Applicants must be M.D., D.O., and/or Ph.D. 
Teaching experience in the U.S. system is desirable but 
not required. Retired persons are encouraged to apply. 
Attractive salary and benefits. Submit curriculum vitae 
electronically to e-mail: jobs@mail.sjsm.org or online 
at website: http://www.sjsm.org. 
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Call for Application 
JAXA International Top Young 
Fellowship (ITYF) 2015 


ITYF application 


The Japan Aerospace Exploration Agency 
(JAXA) International Top Young Fellowship 
(ITYF) invites world’s top-level young 
researchers to work at the Institute of Space 
and Astronautical Sciences (ISAS)/JAXA for 
3 years. 


In addition to producing outstanding academic 
achievements in their areas of specialization, 
ITYF fellows are expected to contribute to 
creating new academic trends in collaboration 
with researchers inside and/or outside 
ISAS. 


An excellent remuneration package is offered, 
including research budget (including travel 
expense) so that the fellow can extend their 
international profile, as well as developing 
collaborations within Japan. 

The application deadline: 17:00 (JST) 

June 19, 2015 

Please see the below link for further details. 
http://www.isas.jaxa.jp/e/researchers/ 
young-fellowship/appli.shtml 
Contact Information 
E-mail: ITYF2015@jaxa.jp 


LUICONN | SCHOOL OF MEDICINE 


Tenure-Track Faculty Position in Neuroscience 


The Department of Neuroscience at the University of Connecticut Health Center seeks an outstanding inves- 
tigator for a tenure-track position at the Assistant or Associate Professor rank working in the field of Vascular 
Dementia, General Dementia or Neuroimmunology. We are seeking highly motivated individuals with demon- 
strated potential for creativity and academic excellence that have been successful in securing extramural 
funding. In this first of several new positions, we are specifically seek an investigator who examines the inter- 
action between the nervous and immune systems in both normal and abnormal cognitive states, especially 
in conditions that lead to dementia. A focus on the cerebral vasculature is desirable, but applicants with 
expertise in other related areas will be considered. 


The ideal candidate’s research program should include cellular as well as in vivo models with a specific interest 
in translational studies. The candidate will participate in a vibrant graduate student (Ph.D.) and dual degree 
(M.D./Ph.D. and D.M.D./Ph.D.) training program, and will be involved in teaching and mentoring at the 
University of Connecticut Health Center. The applicant will have access to a growing translational research 
community and an expanding regional scientific community. The University of Connecticut Health Center is 
engaged in a major expansion of its research programs as part of several transformative state initiatives. The 
Bioscience Connecticut initiative will expand the University of Connecticut Health Center health care offerings 
and research capabilities. Other initiatives include the Next Generation Connecticut initiative at the University 
of Connecticut Storrs campus, which will lead to expansion and collaboration between researchers throughout 
the University system including enhanced human MRI capabilities. There is also a large and active basic neu- 
roscience community at both the Farmington and Storrs sites. The University of Connecticut Health Center 
has established areas of research excellence in Aging, Immunology, Psychiatry, Vascular Biology, and Stroke 
which will be major areas of potential collaboration. In addition, there is the opportunity to collaborate with 
researchers at the newly built Jackson Laboratory for Genomic Medicine, also on The University’s Health 
Center campus. Salary and start-up funds are competitive and many outstanding core facilities are available. 


Minimum qualifications include a Ph.D. and/or M.D. Both established individuals with a history of sustained 
extramural funding and publications and Assistant Professor level applicants completing training or transi- 
tional awards (i.e., KO8, K23, K99) will be considered. 


Applicants should apply at https://jobs.uchc.edu, search number 2015-824, and upload a CV, cover letter 
and concise statements of research and teaching interests, and at least three letters of reference on letter- 
head with signature. To ensure full consideration, applications should be received by June 1, 2015. 


UConn Health is an Equal Opportunity Employer M/F/V/PwD 
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Faculty Position 
Division of Hematology, Department 
of Medicine, Keck School of Medicine 


Assistant or Associate 
Professor: 
Head and Neck Cancer 
Department of Otolaryngology 
Harvard Medical School 


And Division of Head and Neck Oncolozy The USC Division of Hematology and Center 


Massachusetts Eye and Ear 


The Department of Otolaryngology at the Harvard 
Medical School and the Massachusetts Eye and 
Ear (MEE) seek applications from researchers 
interested in head and neck cancer. The MEE 
provides an interdisciplinary and collaborative 
intellectual environment within an active clinical 
setting. The research base in our Department 
includes 30 NIH-funded investigators studying 
diverse aspects of hearing, balance, smell, facial 
movement, and their disorders. MEE researchers 
participate in a variety of teaching programs for 
graduate students, medical students and residents, 
at both Harvard and MIT. 


Candidates should have an outstanding record 
of research accomplishment and a strong desire 
for research collaboration. Applications will be 
accepted until the position is filled, however 
initial evaluation will take place by June 1, 
2015. Interested candidates should send a CV 
and Statement of Research Interests to: Derrick 
T. Lin, M.D., FACS, Chief of Head and Neck 
Oncology, Department of Otolaryngology, 
Massachusetts Eye and Ear, 243 Charles Street, 
Boston, MA 02114. 

The Massachusetts Eye and Ear Infirmary and 
Harvard Medical School are Equal Opportunity/ 
Affirmative Action Employers. Women and 
minorities are encouraged to apply. 


for the Study of Blood Diseases seeks faculty 
members interested in research in novel 
treatment approaches for blood cancers, 
including chimeric antigen receptors, synthetic 
biology, genome engineering, induced 
hematopoietic stem cells, allogeneic stem 
cell graft engineering, adoptive cell therapy, 
TCR gene therapy, bispecific antibodies, 
antibody drug conjugates, and molecularly 
targeted drugs. Applicants with MD or MD/ 
PhD, BC/BE in Hematology and/or Oncology 
and eligible for a CA medical license are 
preferred. Exceptional PhD candidates 
will also be considered. We offer highly 
competitive compensation and appointment at 
the appropriate academic level commensurate 
with experience. 


USC values diversity and is committed to 
equal opportunity in employment. Members 
of all racial, gender and ethnic groups are 
encouraged to apply. 


Please send letter of interest, CV and names of 
3 references to Preet Chaudhary, MD, PhD, 
Chief, Nohl Division of Hematology, Keck 
School of Medicine of USC, 1441 Eastlake 
Avenue, NOR 3470, MC 9172, Los Angeles, 
CA 90033. Email: Preet.Chaudhary@med. 
usc.edu. 
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The making of a science evangelist 


have wanted to be a scientist ever since I was a little girl. I got the idea from a television program 
called 3-2-1 Contact, where I watched a young African-American girl solve problems. I saw my 
reflection in her and was transfixed. As time passed and my science career progressed, I saw that 
reflection less often. Now, years later, after a stint in industry and a negative tenure decision, 
I’m putting myself out there so that others can see their reflection in me. It’s a precarious path, 
exposed and vulnerable, but so far the fruits of my journey have exceeded my fears. 


Great teachers nurtured my inter- 
est in science. There was Kathleen 
Donahue in fifth grade, who stoked 
my passion. There was Jean Howard, 
the unflappable high school physics 
teacher who urged me to apply to 
top colleges. 

At Brown University, there were 
Edelgard Morse, the chemistry 
professor who rescued me when 
my grades fell, and also Anne 
Fausto-Sterling—my learned men- 
tor, professor, and North Star. With 
a strong sense of purpose, a hungry 
mind, and a willingness to work 
hard, I assumed my success was as- 
sured. It wasn’t; I could easily have 
been one of the many nameless casu- 
alties science routinely sheds. Weary 
and battle worn, I made it through. 

I continued on to Stanford Uni- 
versity to seek a Ph.D. As the only black student in the mate- 
rials science department, I felt lonely and lost. But two good 
things happened there: I was awarded a AAAS Mass Media 
Fellowship and spent a summer writing for Time. (AAAS is 
the publisher of Science.) And I taught an introductory mate- 
rials science class at a local community college. I was smitten 
with science communication. My students were smitten with 
science. I learned that I loved teaching science and that I 
have a talent for reaching apprehensive students. There was 
a third good thing: I finished my degree. 

I took a research position at Bell Labs. There, I was in 
constant contact with other minority scientists. I saw my 
reflection again. My creativity flourished. I was awarded a 
spot on MIT Technology Review's Innovators Under 35 list. 
I wanted to stay forever, but the telecom bubble burst, and 
my job ended. 

So I landed in a tenure-track faculty position at Yale Uni- 
versity. I established a well-funded lab, earned top teach- 
ing scores, and maintained a respected research program. I 
hosted a fun lecture series for kids called Science Saturdays. 
Children loved it and I did, too. In fact, I loved it more than 


“T saw my reflection in her 
and was transfixed.” 


I loved my research. Faculty life 
was not ideal—I could not see my 
reflection in my colleagues—but it 
was workable and I hoped to stay. 
Fate, and the tenure committee, had 
other plans. 

The denial of tenure thrusts you 
into a caste of the unclean. You're 
expected to pick up your book bag, 
your lumbar pillow, and your shred- 
ded dignity and sneak meekly out 
the back door. By the time I left, I 
was happy to go. My gut told me it 
could take no more, that academic 
science wasn’t my path. I was ill, 
fatigued, and uninterested in climb- 
ing that ladder again. 

I used the tenure decision as an 
opportunity to explore afresh what 
I should do with my “one wild and 
precious life,” as Mary Oliver called 
it in her poem, The Summer Day. It was time for me to dis- 
cover my true vocation, which Aristotle supposedly said 
could be found at the intersection of the world’s needs and 
my passions. 

By now, I knew my passions well enough: science, teach- 
ing, sharing my love for science. So I looked at the world’s 
needs and figured out my vocation. In the time I had left at 
Yale, I produced some science videos and posted them on 
YouTube. It felt good. Things opened up. TED invited me 
to give a talk. Dignitaries invited me to meetings. Random 
House published my book. 

I’m now happier and healthier. I’m following a path worn 
by Isaac Asimov, Carl Sagan, Neil deGrasse Tyson, and Bill 
Nye: lecturing, writing, and publicizing science. My new 
vocation offers less security than the old one did, but that 
seems fitting somehow. It’s wilder and more precious. & 


Ainissa Ramirez is the author of two books and co-hosts 
the science podcast Science Underground. For more on life 
and careers, visit ScienceCareers.org. Send your story to 
SciCareerEditor @aaas.org. 
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